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Abstract 

In this paper we develop a measure-theoretic method to treat problems in hypergraph theory. 
Our central theorem is a correspondence principle between three objects: An increasing hypergraph 
sequence, a measurable set in an ultraproduct space and a measurable set in a finite dimensional 
Lebesgue space. Using this correspondence principle we build up the theory of dense hypergraphs 
from scratch. Along these lines we give new proofs for the Hypergraph Removal Lemma, the 
Hypergraph Regularity Lemma, the Counting Lemma and the Testability of Hereditary Hypergraph 
Properties. We prove various new results including a strengthening of the Regularity Lemma and 
an Inverse Counting Lemma. We also prove the equivalence of various notions for convergence of 
hypergraphs and we construct limit objects for such sequences. We prove that the limit objects are 
unique up to a certain family of measure preserving transformations. As our main tool we study the 
integral and measure theory on the ultraproduct of finite measure spaces which is interesting on its 
own right. 
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1 Introduction 



The so-called Hypergraph Regularity Lemma (Rodl-Skokan |14|, Rodl-Schacht fT3\, Gowers ||5l, 
later generalized by Tao fl7 |) is one of the most exciting result in modern combinatorics. It exists 
in many different forms, strength and generality. The main message in all of them is that every k- 
uniform hypergraph can be approximated by a structure which consists of boundedly many random- 
looking (quasi-random) parts for any given error e. Another common feature of these theorems is 
that they all come with a corresponding counting lemma fl2] which describes how to estimate the 
frequency of a given small hypergraph from the quasi-random approximation of a large hypergraph. 
One of the most important applications of this method is that it implies the Hypergraph Removal 
Lemma (first proved by Nagle, Rodl and Schacht [12]) and by an observation of Solymosi llT6l 
it also implies Szemeredi's celebrated theorem on arithmetic progressions in dense subsets of the 
integers even in a multidimensional setting. 

In this paper we present an analytic approach to the subject. First, for any given sequence of 
hypergraphs we associate the so-called ultralimit hypergraph, which is a measurable hypergraph 
in a large (non-separable) probability measure space. The ultralimit method enables us to convert 
theorems of finite combinatorics to measure theoretic statements on our ultralimit space. In the 
second step, using separable factors we translate these measure-theoretic theorems to well-known 
results on the more familiar Lebesgue spaces. 

The paper is built up in a way that these two steps are compressed into a correspondence 
principle between the following three objects 

1. An increasing sequence {Hi}°li of fc-uniform hypergraphs 

2. The ultraproduct hypergraph H C X'^. where X is the ultraproduct of the vertex sets. 

3. A measurable subset W C [0, 1]^''^^. 

Using this single correspondence principle we are able to prove several results in hypergraph theory. 
The next list is a summary of some of these results. 

1 . Removal lemma: We prove the hypergraph removal lemma directly from Lebesgue's density 
theorem applied for the set W C [0, 1]^ In a nutshell, we convert the original removal 
lemma into the removal of the non-density points from W which is a 0-measure set. (Theorem 

m 

2. Regularity lemma We deduce the hypergraph regularity lemma from a certain finite box 
approximation of W in Li. To be more precise, W is approximated by a set which is the 
disjoint union of finitely many direct product sets in [0, 1]^ ( Theorem|2]l 

3. Limit object We prove that W serves as a limit object for hypergraph sequences {Hi}^^ 
which are convergent in the sense that the densities of every fixed hypergraph F converge. 
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Limits of /c-uniform hypergraphs can also be represented by 2*^ — 2 variable measurable func- 
tions w : [0,1]^ [0,1] such that the coordinates are indexed by the proper non-empty 
subsets of {1, 2, . . . , fc} and w is invariant under the induced action of Sk on the coordinates. 
This generaUzes a theorem by Lovasz and Szegedy. ( Theorem|2ll 

4. Sampling and concentration: Even tough is a measurable set, it makes sense to talk about 
random samples from W which are ordinary hypergraphs. We prove concentration results for 
this sampling process. The sampling processes give rise to random hypergraph models which 
are interesting on their own right. ( Theorem[TT|and Theorem [TZt 

5. Testability of hereditary properties: We give a new proof for the testability of hereditary 
hypergraph properties (This was first proved for graphs by Alon-Shapira and later for hyper- 
graphs by Rodl-Schacht). The key idea is based on a modified sampling process from the limit 
object W that we call "hyperpartition sampling". This creates an overlay of samples from W 
and the members of the sequence {Hi}^^^ such that expected Hamming distance of Hi and 
the corresponding sample is small. (Theorem[8]l 

6. Regularity as compactness: We formulate a strengthening of the hypergraph regularity 
lemma which puts the regularity in the framework of compactness. Roughly speaking this 
theorem says that every increasing hypergraph sequence has a subsequence which converges 
in a very strong (structural) sense. Here we introduce the notion of strong convergence. 
(TheoremlUi 

7. Distance notions: We introduce several distance notions between hypergraph limit objects 
(and hypergraphs) and we analyze their relationship. (TheoremfTOll 

8. Uniqueness: We prove the uniqueness of the limit object up to a family of measure preserving 
transformations on [0, 1]^ This generalizes a result of Borgs-Chayes-Lovasz from graphs 
to hypergraphs. (Theorem|9]l 

9. Counting Lemma: We prove that the structure of regular partitions determine the 
subhypergraph densities. (Theorem[T3]and Corollarv l4.2l i 

10. Equivalence of convergence notions: We prove that convergence and strong convergence are 
equivalent. For technical reasons we introduce a third convergence notion which is a slight 
variation of strong convergence and we call it structural convergence. This is also equivalent 
with the other two notions. The third notation enables us to speak about structural limit objects 
which turns out to be the same as the original limit object. (Theorem[T4]l 

1 1 . Inverse counting lemma: Using the equivalence of convergence notions we obtain that if two 
hypergraphs have similar sub-hypergraph densities then they have similar regular partitions. 
In other words this means that regular partitions can be tested by sampling small hypergraphs. 
(Corollary |4j]l 
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Remark: In our proofs we use the Axiom of Choice. However, Godel in his seminal work The 
Consistency of the Axiom of Choice and the Generalized Continuum Hypothesis with the Axioms of 
Set Theory proved that (see also |4]): If T is an arithmetical statement and F is provable in ZF with 
the Axiom of Choice then F is provable in ZF. In fact, Godel gave an algorithm to convert a formal 
ZFC-proof of an arithmetical statement to a ZF-proof. An arithmetical statement is a statement in 
the form of 

(Qia;iQ2a;2 . . . QkXk)P{xi,X2, . . . ,Xfc) , 

where the Q/s are existential or universal quantifiers and the relation P{xi,X2, ■ ■ ■ ,Xk) can be 
checked by a Turing machine in finite time. The reader can convince himself that the Hypergraph 
Removal Lemma, The Hypergraph Regularity Lemma, the Counting Lemma and the Inverse Count- 
ing Lemma are all arithmetical statements. 

Acknowledgement: We are very indebted to Terence Tao and Laszlo Lovasz for helpful discus- 
sions. 

2 Preliminaries 

2.1 Homomorphisms, convergence and completion of hypergraphs 

Let Hk denote the set of isomorphism classes of finite fc-uniform hypergraphs. For an element 
H G Hk we denote the vertex set by V{H) and the edge set by E{H). In this paper we view a 
/c-uniform hypergraph H on the vertex set as a subset of V'^ without having repetitions in the 
coordinates and being invariant under the action of the symmetric group Sk ■ Let wi , U2 , W3 , . . . , W| v | 
be the elements of V. Then an edge E G E{H) is a subset of fc-elements {vi^ , , . . . , w^^. } C V 
such that [vi^ ,Vi2, . . . ,Vi^) G H. If L is a family of edges in H, then L denote the set of elements 
{vi^ ,Vt^, . . . ,Vi^) e H such that {v^^ ,Vt^, . . . ,Vi^} e L. 

Definition 2.1 A Iiomomorpliism between two elements F,He Hk is a map f : V{F) ^ V{H) 
which maps edges of F into edges of H. We denote by hom(_F, H) the number of homomorphisms 
from F to H and by hom°(i^, H) the number of injective homomorphisms. An induced liomomor- 
phism is a map f : V{G) »— > V{H) which maps edges to edges and non edges to non edges. 

Note that in the definition of horn the map V{F) V{H) does not have to be injective but the 
definition implies that it is injective if we restrict it to any edge of F. There is a simple inclusion- 
exclusion type formula which computes hom" from hom. To state this we will need some more 
definitions. 

Let V ^ {-Pi, P2, • • ■ , Ps} be a partition of V{F) and let / : V{F) ^Vhe. the function which 
maps each vertex to its partition set. We define a hypergraph F{'P) whose vertex set is V and the 
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edge set is f{E{F)). Note that F{V) is a fc-uniform hypergraph if and only if every partition set 
intersect every edge in at most 1 element. We define the height h{V) of V as — \P\- 

Lemma 2.1 If F and H are k-unifonn hypergraphs then 

hom(F, H)=Y^ hom\F{V),H) 

V 

and 

hom°(F, B) = ^(-l)''^^^ hom(F(P), H) 

V 

where V runs through all partitions ofV{F) and \iOiii{F{V), H) and \iOTa\F{V), H) are defined 
to be if F{V) is not k-unifonn. 

Proof. The first equation is obvious from the definitions. It implies that for any partition P we have 
that 

hom{F{P),H) = hom° {F{P'),H) 

V'<V 

where the sum runs through all partitions V' such that Pisa refinement of V' . The inversion formula 
for the partition lattice yields the second equation. 
Now we are ready to prove the next lemma. 

Lemma 2.2 If Hi, H2 G Tik ore two hypergraphs such that hom(i^, Hi) — hom(i^, H2) for every 
element F e Tik then Hi and H2 are isomorphic. 

Proof Lemma |2T] implies that horn" (i^, Fi) = hom°(i^, iJ2) for all hypergraphs G Hk- 
In particular liom"(i?i,i?2) = hom°(i?i, > and hom°(i?2, = hom°(i?2, i?2) > 
which implies that \V{Hi)\ = \V{H2)\ and \E{Hi)\ = \E{H2)\. We obtain that every injective 
homomorphism from Hi to H2 is an isomorphism. Since such a homomorphism exists the proof is 
complete. I 

The next two definitions will be crucial. 

Definition 2.2 The homomorphism density t(F, H) denotes the probability that a random map 

f : V{F) V{H) is a homomorphism. It can also be defined by the equation 

hom(F,g) 
^ \ViH)\\v(P)\' 

We also define ti^diF, G) which is the probability that a random map f : V{F) y{H) is an 

induced homomorphism. Finally t9^^(^F, H) denotes the probability that a random injective map is 

an induced homomorphism. 

Definition 2.3 A t-fold equitable blowup of a hypergraph H £ Hk is a hypergraph H' which is 
obtained by replacing each vertex of H by t new vertices and each edge of H by a complete k-partite 
hypergraph on the corresponding new vertex sets. 
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It is clear that if H' is a t-fold equitable blowup of H then hom(F, H') = honi(F, 
and consequently t{F^ H) = t{F, H'). The next lemma shows that hypergraphs from TLk are "essen- 
tially" separated by homomorphism densities except that equitable blowups of a hypergraph cannot 
be separated. 

Lemma 2.3 Let Hi , H2 £ TY^ be two hypergraphs and assume that t{F, Hi ) — t{F, H2 ) for every 
F G Tifc. Then there exists a H dz Tik which is an equitable blowup of both Hi and H2. 

Proof Let H'l be the \V{H2)\-fo\A equitable blowup of Hi and let H'2 be the |F(iJi)|-fold 
equitable blowup of H2. Then 

\V{H'^\^\V{H'2)\ = \V{Hi)\\V{H2)\ 

and t{F, H'l) = t{F, H'2) for every F e Uk- We obtain that 

\vom{F,H'i) ^t{F,H'i)\V{H'i)\^^^^^\ = i/^)|F(i72)'|'^*-^" =hom(F,i/^) 

for every F E Hk- By Lemma |272l the proof is complete. I 
The previous lemma motivates the following definition 

Definition 2.4 Two hypergraphs Hi,H2 G Hk will be called density equivalent ;/ there exists 
H e Hk which is an equitable blowup of both Hi, H2 or equivalently, by Lemma |Z3] t{F, Hi) = 
t{F, H2) for every F e Hk- 

Homomorphism densities can be used to define two convergence notions on the set Hk which 
are slight variations of each other. 

Definition 2.5 A hypergraph sequence {Hi}°^i in Hk is called convergent // 

lim t{F,H,) 

i — >QC 

exists for every F G Hk- We say that {Hi}'^i is increasingly convergent if it is convergent and 

lim \ViHi)\ = 00. 

i — ^00 

Both convergence notions lead to a completion of the set Hk- We denote the first completion by 
Hk and the second one by Hk- These two spaces are very closely related to each other. It will turn 
out that Hk is arc-connected whereas Hk is the union of Hk with the discrete topology and Hk- In 
the space Hk the set Hk behaves as a "boundary" for the set Hk- An advantage of the set Hk is 
that it directly contains the familiar set Hk of hypergraphs. A disadvantage of Hk is that it is not 
connected. On the other hand Hk is connected and fc-uniform hypergraphs are represented in it up 
to dense equivalence. In this paper we focus only on Hk so we give a precise definition only of this 
space. 
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Let S be the following metric on Hk- For two elements Hi,H2 E Tik we define S{Hi, H2) as 
the infimum of the numbers e > for which \t{F, Hi) - t{F, H2)\ < e holds for all F e Hk with 
I < Two hypergraphs have (5-distance zero if and only if they are density equivalent. We 
denote the completion of this metric space by Hk- 

The elements of the space Hk have many interesting representations. We give here one which is 
the most straightforward. Let A4k denote the compact space [0, 1]^*=. Every graph H e Hk can be 
represented as a point in Mk by the sequence T{H) = {t{F, H)}Fe'Hk- Lemma |23] the point 
set T{Hk) represents the density equivalence classes of fc-uniform hypergraphs. The closure of 
T{Hk) in TMfc is a representation of Hk- This representation shows immediately that Hk is compact 
since it is a closed subspace of the compact space Mk- To see that Hk is arc-connected requires 
some more effort, but it will follow easily from one of our results in this paper (Theorem]?]) 

An important feature of the space Hk is that it makes sense to talk about homomorphism densi- 
ties of the form t{F, X) if X G Hfc and e Hk- 

We will denote by [n] the set {1, 2, . . . , n). For a subset B C [fc], r{B) will stand for the 
non-empty subsets of B. Similarly, k) will denote the set of all non-empty subsets of [n\ 

having size at most k. If ii' is a hypergraph on [n] and H C V^^"^ is a fc-uniform hypergraph then 
T{K, H) C V^"^ denotes the {K, iJ) -homomorphism set, where X2, . . . , Xn) e T{K, H) if 
1 ^ xi, 2 — > X2, . . . , n — > x„ defines a homomorphism. Clearly \T{K, H)\ = hom{K, H). For a 
subset E C [n], \E\ = fc let Pe : l^'"' be the natural projection and P^^ : V^''^ be 

the natural bijection associated to a bijective map se '■ [k] E. Then it is easy to check that 

EeE(K) 

Similarly, Ti^diK, H) c T^'"' denotes the {K, _ff)-induced homomorphism set, 
where (xi, 0:2, ... , x„) £ Tind{K, H) if 1 ^ xi, 2 — > X2, . . . ,n ^ Xn defines an induced homo- 
morphism. Then 

EeE(K) E'£E{K)'= 

where H'^ denotes the complement of H in the complete hypergraph on the set V - A simple 
inclusion-exlusion argument shows that if a hypergraph sequence is convergent, then for 

any fc-uniform hypergraph F the sequence {tind{F, Hi)}°^^ is convergent as well. 

2.2 The Removal and the Regularity Lemmas 

First we state the Removal Lemma. 

Theorem 1 (Hypergraph Removal Lemma) For every k-uniform hypergraph K and constant e > 
there exists a number 5 — S{K, e) such that for any k-uniform hypergraph H on the node set X 
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with t{K, H) < S there is a subset L of E{H) with L < e('^') such that t{K, H \ L) = 0. (SS^j. 

Now let us turn to the Regularity Lemma. Let X be a finite set, then Kr{X) C X^' denotes the 
complete r-uniform hypergraph on X. An Z-hyperpartition 7i is a family of partitions Kr{X) = 
u'j^iP^, where P,^ is an r-uniform hypergraph, for 1 < r < fc. We call H (5-equitable if for any 

1 < r < fc and I < i < j < I: 

\\p:\ ~~ iPiii ^ ^ 



\Kr{X)\ 

An /-hyperpartition TC induces a partition on Kk{X) the following way. 

• Two elements a,b E Kk{X), a = {ai, a2, . . . , Ofc}, 6 — {bi, b2, ■ ■ ■ ,bk} are equivalent if 
there exists a permutation cr G Sk such that for any subset A — {ii, 12, ... , i\A\} C [fc], 
{a^,,a^^,. . . , a,|^|} and &o-(j2)7 • ■ ■ : are both in the same P^^^ for some 1 < 

J<1- 

It is easy to see that this defines an equivalence relation and thus it results in a partition Uj^iCj 
of Kk{X) into H-cells. A cylinder intersection L c Kr{X) is an r-uniform hypergraph de- 
fined the following way. Let Bi, i?2,. . . Br be {r — l)-uniform hypergraphs on X, then an r-edge 
{ai, 02, ... , ttr} is in L if there exists a permutation t E Sr such that 

{aa(i),a<T(2), ■ ■ • ,a<T(i-i),a<T(j+i), • ■ ■a<T(r)} e B, forany 1 < i < r . 

As in the graph case, we call an r-uniform hypergraph G e-regular if 

\G\ \GnL\ 



\KriX)\ \L\ 

for each cylinder intersection L, where \L\ > e\Kr{X) \ . Now we are ready to state the hypergraph 
regularity lemma for fc-uniform hypergraphs (see 0, Q, ifTSll . lfT4l . ifTTll ). 

Theorem 2 (Hypergraph regularity lemma) Let fix a constant k > 0. Then for any e > and 

function _F : N — > (0, 1) there exist constants c — c(e, F) and No(e, F) such that if H is a k-uniform 
hypergraph on a set X, \X\ > No^e, F), then there exists an F{l)-equitable l-hyperpartition Tifor 
some 1 < I < c such that 

• Each Pj is F{l)-regular 

• AT| < e('^') where T is the union of some Ti-cells. 

2.3 Combinatorial Structures 

In this subsection we introduce some further definitions about hyperpartitions. Let TL ~ {P^ } be an 
^ -hyperpartition on a set X where \ < j <l and 1 < r < fc. We shall need the notion of a directed 
H-cell. Let / : r([fc]) (-^ [I] be an arbitrary function. Then the directed cell with coordinate / is the 
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set of ordered fc-tuples {xi,X2, ■ ■ ■ , Xk) € such that {xi^,Xi^, . . . , Xi^} G Pr for every set 
S = {ii,i2, ...,ir}& r{[k]). 

The symmetric group Sk is acting on X'^ by permuting the coordinates and this action induces 
an action on the directed W-cells. Note that a W-cell in the non-directed sense is the union of an 
orbit of a directed W-cell under the action of Sk- 

An abstract {k, /)-cell is a function c : r{[k]) i— > [I]. A {k, Z)-cell system C is a subset of all 
possible {k, /)-cells. The symmetric group Sk is acting on r{[k]) and this induces an action on the 
{k, Z)-cells. We say that the system C is symmetric if it is invariant under the action of Sk- Such a 
symmetric (fc, /)-system shall be called a combinatorial structure. 

Thus if W is an Z-hyperpartion on [n] and C is a combinatorial structure then we can define a k- 
uniform hypergraph H{H, C, [n]) the following way. The hypergraph H{H, C, [n]) is the union of 
those W-ceUs in [n] which belong to the coordinates of the combinatorial structure C. If -F is a 
fc-uniform hypergraph then we may compute the homomorphism density of F in a combinatorial 
structure C as follows. Assume that V{F) = [n] and fix a bijection se ■ [k] E for each edge 
of F. A function g : r{[n],k) [I] is called a homomorphism of F into C if for every edge E 
the restriction g o se r[k] — > [Z] is a (fc, /)-cell of C. The homomorphism density t{F, C) is the 
probabihty that a random map / : r([n], k) i— > [Z] is a homomorphism. 

2.4 Regularity Lemma as compactness 

In this section we state a new type of regularity lermna together with a counting lemma which 
implies the one stated in the previous section. An interesting feature of this regularity lermna is 
that arbitrarily decreasing functions (which are common features in "strong" regularity lemmas) are 
replaced by a sequential compactness type statement. 

Theorem 3 (Hypergraph Sequence Regularity Lemma) For every e > and k-uniform increas- 
ing hypergraph sequence {Hi}°°^^ there is a natural number I — l{e, {Hi}°^i) such that there is a 
subsequence {Hl}°l^ o/{-ffj}^]^ together with a sequence of l-hyperpartitions {'Hi}'^i satisfying 
the following properties 

1. For every i there is Ti which is the union of some Hi-cells such that \HlATi\ < e('^'') where 
T is the union of some Hi-cells and Xi is the vertex set ofH[. 

2. The hyperpartition Hi is Si-equitable and 6i-regular where limj^oo Si = 0. 

3. Every Ti has the same combinatorial structure C 

4. limj_>oo t{F, Ti) = t{F, C)for every k-uniform hypergraph F. 

Note that the value of / depends on the concrete sequence {Hi}°^^. To see this one can take a 
large random graph G on n vertices and then take the i-fold equitable blowups Gi of G. The reader 
can check that in this case (with high probability) / = n for any e < 1/2. 
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It is quite natural to interpret Theorem[3]in terms of compactness. 

Definition 2.6 An increasing hypergmph sequence {Hi}'^^ is called strongly convergent if for 

every e > there is a number I, hypergraphs Ti on the vertex sets of Hi and l-hyperpartitions 
Tii on Xi for every i such that 

1. Ti is the union of some Tii-cells 

2. |i/,AT,| <e(l^'l) 

3. The hyperpartition Tii is 5i regular and Si equitable where limi^oo — 0. 

4. Every Ti has the same combinatorial structure. 

Using this definition the sequence regularity lemma gets the following simple form: 

Theorem 4 (Regularity as Compactness) Every hypergraph sequence has a strongly convergent 
subsequence. 

2.5 Euclidean hypergraphs 

The goal of this subsection is to generalize the notion of /c-uniform hypergraphs and homomor- 
phism densities to the Euclidean setting in order to define limit objects for convergent sequences 
of finite hypergraphs. Seemingly, the appropriate Euclidean analogue of fc-uniform hypergraphs 
would be just the -invariant measurable subsets of [0, 1]''. One could easily define the notion of 
homomorphisms from finite fc-hypergraphs to such Euclidean hypergraphs and even the associated 
homomorphism densities. The problem with this simple notion of Euclidean hypergraphs is that 
they could serve as limit objects only for very special finite hypergraph sequences. In order to con- 
struct (see Example 1.) limit objects to the various random construction of convergent hypergraph 
sequences one needs a little bit more complicated notion. 

Let A; > and consider [0, 1]^*^^^ = [0, that is the set of points in the form 

), where Ai, ^2, • • ■ ,^2''-i is a list of the non-empty subsets of [k]. Observe 
that the symmetry group Sk acts on [0, by 

We call a measurable S — /c-invariant subset TC C [0,1]^ ~^ a fc-uniform Euclidean hypergraph. 
Now let K he a finite fc-uniform hypergraph and let T^{K) C r{[n] , fc) be the simplicial complex of 
K consisting of the non-empty subsets of the fc-edges of K. Let Ci, C2, . . . , C^^^k)^ be a list of the 
elements of T^{K). 

Definition 2.7 (Euclidean hypergraph homomorphism) A map g : r{[n], fc) —> [0, 1] is called a 
Euclidean hypergraph homomorphism from K to Ti. if for any edge E G E{K): 

{g{sE{Ai)),g{sE{A2)), . . ■ , 5(sij(^2^-i))) e 
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where se [k] ^ E is a fixed bijection. The induced Euclidean hypergraph homomorphism is 
defined accordingly. 

Note that the notion of hypergraph homomorphism does not depend on the choice of se- Thus the 
Euclidean hypergraph homomorphism set T{K, H) C [0, 1] ' (["l '^) is the set of points 
(2/Bi,2/B2: • ■ • .2/B|,.(H,fc)|) such that the map g :^ [0,1], ^(Bi) = is a homomorphism. One 
can similarly define the Euclidean hypergraph induced homomorphism set. We call X{T{K, H)) 
the I S(isr) [-dimensional Lebesgue-measure of the homomorphism set the homomorphism density. 
We say that the hypergraph H is the limit of the fc-uniform hypergraphs {-ffnjj^i if 

lim t{K,Hn) = X{T{K,n)) 

for any finite fc-uniform hypergraph K. 

Example 1. There are many ways to define random /c-uniform hypergraph sequences. The most 
natural one is the random sequence {Hn}^^i, where each edge of the complete hypergraph on n- 
vertices is chosen with probability i to be an edge of i/„. Thus for any fc-uniform hypergraph K, 
lim„^oo t{K, Hn) = (5)'^^^-" with probabihty 1. Let us consider the hypergraph 

An easy calculation shows that \{T{K, Ti,)) = (^)''^*-^'" that is Ti. is the limit of a random hyper- 
graph sequence {Hn}^^i with probability 1. 

Example 2. Now we consider a different notion of randonmess. Let the random sequence {il^ln^i 
be constructed the following way. First choose each (fc — l)-subset of [n] randomly with probabihty 
i. Then E will be an edge of H'^ if all its {k — l)-dimensional hyperedges are chosen. Clearly, 
lim„_»oo t{K, H'^) = (5)'^'*=-^ with probability 1, where |-fl'|fe-i is the number of 
{k — 1) -hyperedges in ^{K). Now we consider the hypergraph 

n' = {{xAi,XA2,---,XA^k_J e [0, l]^*""^ I < a;i,2,3,...,fe-i < ^ ,0 < 2:1,2,3,. ..,fe-2,fe < ^, 

...,0< 0:2,3,. ..,fe < 

Then X{T{K, W')) = (i) I^U-i . Thus H' is the limit of a random hypergraph sequence {H^}^^i 
with probabihty 1. 

Now let K he a finite fc-uniform hypergraph. For any E € E{K) we fix a bijection se '■ [k] ^ E 
as above. Let L,^ : [0, ^ [0, Ij'-d^D, 

Lsb{xAi,XA2j ■ ■ ■ ,Xa^,,_^) = {Xse{Ai),Xsj^(A2)j ■ ■ ■ jXsB{A^k_i)) 

be the natural measurable isomorphism associated to the map se- Also, let Le '■ [0, !]'"(["■!''=) — > 
[0, be the natural projection. Then for a fc-uniform Euclidean hypergraph H and a finite 
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fc-uniform hypergraph K onn vertices 

T{K,n)^ fl L^\L,,{n)). (1) 

Also, 

T,nd{K,n)^ fl L^i(L,,(H))n fl L^i(i,^,(H^)). (2) 

E£E{K) E'^E(KY 

We formulate ([T]l in an integral form as well. Let : [0, l]''^!''!) ^ {0, 1} be the characteristic 
function of the Euclidean hypergraph Ti. We call such an object a hypergraphon. Then 



X{T{K,n))^ f [ ... f f II ■^'e] dxc.dx. 

Jo Jo Jo \E^EiK) J 



where ^'_b is the characteristic function of Clearly, 
Thus, we have the integral formula 



t t - f i n ^ni^ 

Jo Jo Jo \EeE{K) 



X{T{K,n)) = 



Remark: One can introduce the notion of a projected hypergraphon which is the projection 
of a hypergraphon to the first 2''" — 2 coordinates, where the last coordinate is associated to [k] itself. 
That is 

Wj^{xA,,XA:,,...,XA^^^J ^ / Wj^{xA,,XA2,---,XA^^_JdxA^^_^. 

Jo 

That is Wj^ is a [0, 1] -valued function which is symmetric under the induced 5fe-action of its coor- 
dinates. By the classical Fubini-theorem we obtain that if 7i is the limit of the hypergraphs {iJi}^^ 
then 

limi^aot{K, Hi) = 

= f f ■■■ f n ^H('''^B{A,),Xsj,(A2),---,x,^^A,,_^))dxc,dxc2---dxc^^^^_^, 

Jo Jo Jo EeE{K) 

where the integration is over the variables associated to the simplices of dimension less than k. Note 
that in the case fc 2 it is just the graph limit formula of ||9J. 

Note that for a combinatorial structure C one can define a hypergraphon Wc ^ [0,1]^ Recall 
that an /-box Z in [0, 1]^''^^ is a product set in the form 

X — , — - — X ... X 



r I J \r I J \ I I 

The map f : r[k] ^ [I], defined by f{Aj) = ij is the coordinate function of the box Z. Then Wc is 
the union of the boxes corresponding to the coordinates of the combinatorial structure C. It is easy 
to check that t{F, C) — t{F, Wc) for any fc-uniform hypergraph F. 
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2.6 W^-random graphs and Sampling 

Let us consider the following natural sampling process for fc-uniform hypergraphs. We pick n 
vertices vi,V2, ■ ■ ■ ,Vn independently and uniformly at random from the vertex set X of H and then 
we create a hypergraph G{H, n) with vertex set [n] such that {ii, 12, . . . , ik} is an edge in G{H, n) 
if and only if {vi-^ , , . . . , Wi^. } is an edge in H. Thus Gr{H, n) is a hypergraph valued random 
variable. The distribution of n) can be described in terms of the homomorphism densities 
tmd[F, H) where 1^(^)1 < n. The probability that we see a fixed hypergraph F on [n] in G{H, n) 
is equal to tiad{F, H). 

Now we generalize sampling for Euclidean hypergraphs W C [0, Let us introduce a 
random variable Xs for every set S E r{[n], k) which are independent and have uniform distribution 
in [0,1]. Then {^1,12, . . . ,ife} is an edge in G(M^,n) if • ■ • = 1 where 

Ai,A2, . . . , ^2*=-! the non empty subsets of {ii, 12, ■ • ■ , ik}- This again gives a hypergraph 
valued random variable on [n] which is the infinite analogy of the finite setting. 

Another important sampling process from W wiU be called the hyperpartition sampling. As- 
sume that H = {Pj}i<j<i,i<r<k is an Z-hyperpartition on the set [n]. We consider the function 
g : r([n\, k) [I], which is equal to j if and only if 5* £ -Pj^'- Now we define a sampling process 
G(W, H, n) in the same way as G(W, n) with the extra restriction that Xs has uniform distribution 
in the interval [{g{S) — g{S)/l). This sampling process has the property that tiad{F, W) = 
implies UndiF, G{W, H, n)) = with probability 1. 

Finally, we introduce the notion of random coordinate systems. Let Z„ be the random variable 
which is a random point in [0, IJ^'d"] '^) with uniform distribution. In other words Z„ is a r([rt], k)- 
tuple of independent random variables with uniform distribution {/T}Ter([n].fc)- L^t [njg be the 
set of elements in [nY without having repetitions in their coordinates. We introduce the random 
variables r" : [n]^ ^ [0, such that the component t^(xi, X2, . . . , Xk) corresponding to an 
element {ii,i2, ••■,«*} — S G r{[k]) is equal to the value of fxi-^,xi2,...,xi^- We call the random 
variables t" random coordinate systems corresponding to [n]. An important property of (r") is 
that for a measurable set W C [0, the distribution of the random hypergraph-valued function 
(r")^^(VF) is exactly the same as the distribution of G{W, [n]). 

2.7 Ultraproducts of finite sets 

First we recall the ultraproduct construction of finite probability measure spaces (see fSl). Let 
{Xi}^i be finite sets. We always suppose that \Xi\ < \X2\ < {X^l < . . . Let uihe a nonprincipal 
ultrafilter and lim^^ : Z°°(N) ^ M be the corresponding ultralimit. Recall that limc^ is a bounded 
linear functional such that for any e > and {a„}5^j^ G 1°"{N) 

{i G N I Oi G [lima„ — e, lima„ + e]} G w . 
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The ultraproduct of the sets Xi is defined as follows. 

Let X = rii^i ^i- We say that p = {p,} -5,1, q = {qi}iZi e X are equivalent, p ~ g", if 

{i eN\pi = q,} eoj. 

Define X := X/ ~. Now let V{Xi) denote the Boolean-algebra of subsets of Xi, with the nor- 
malized measure iJ.i{A) = ^ . Then let V = H^^i 'Pi^i) and P = V/I, where I is the ideal of 
elements {^O^i ^^^^ that {i e N | A, = 0} e a; . Notice that the elements of V can be identified 
with certain subsets of X: If 

p = G X and A = [{Ai}r=,] e V 

then peAif{ieN\pieAi}Guj. Clearly, if A = [{Ai}^^], B = [{BJ^J then 

. A' = [{Ami], 

• AuB=[{AiUBi}?l^], 
. AnB=[{AinBi}91,]. 

That is "P is a Boolean algebra on X. Now let ii{A) — linio, ^J.i{Ai). Then : P ^ M is a finitely 
additive probability measure. We will call A = [{Ai}°l^] the ultraproduct of the sets {Ai}°l-^. 

Definition 2.8 C X is a nullset if for any e > there exists a set A^ G V such that iV C A<; and 
f^{Ae) < e. The set ofnullsets is denoted by N. 

Proposition 2.1 M satisfies the following properties: 

• ifN e M and M C N, then M e Af. 

• Jf{Nk}kLi are elements ofAf then U^^A^fe & Af as well. 

Proof. The first part is obvious, for the second part we need the following lemma. 

Lemma 2.4 ^{Afe}^i are elements ofV and lim;_>oo /u(Uj.^jj4fe) = t then there exists an ele- 
ment B £V such that n{B) = t and Ak C Bfor all fc e N. 

Proof Let Bi = U^^^Afe, = ti, lim;^oo ti = t . Let 

T; = |iGN I |/Xi(uU4)-id<^} . 

where = [{^USi] • Observe that G cj. If i S n^^Ti but i ^ T„+i, then let C\ = U^^^ A*. . 
IfieTi for all / G N, then clearly /i«(U^iA*,) = t and we set Ci := U^^A*. . Let B := 
[{Ci}^^] . Then ii(B) = t and for any k e N:A^ CB. I 

Now suppose that for any j > 1,A] G Af. Let B^j G V such that A] CB^j and ij,(B'j) < e—. 
Then by the previous lemma, there exists B'^ gV such that for any j > 1 -B^ C and p.{B^) < e. 
Since UjliAj C B^, our proposition follows. I 
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Definition 2.9 'We call B CX-a measureable set if there exists B £V such that BAB G Af. 

Proposition 2.2 The measurable sets form a a-algebra and i-i{B) = ^j.{B) defines a probability 
measure on B^. 

Proof. We call two measurable sets B and B' equivalent, B = B' if BAB' G M. Clearly, if 
A = A', B = B' then A" = {A'f, A\J B = A' \J B' , Ar\ B = A' r\ B' . Also if A, S G 
and A^ B, then fJ.{A) = fJ,{B). That is the measurable sets form a Boolean algebra with a finitely 
additive measure. Hence it is enough to prove that if Ak G P are disjoint sets, then there exists 
A e P such that U^^A^ = A and fi(A) = J^kLi m(^) • Note that by Lemma l24l there exists 
AeP such that iJ.(A) = Y^kLi and 31 C for all fc > 1. Then for any j > 1, 



Hence we constructed an atomless probability measure space (X, B^^, fi). Note that this space is 
non-separable, that is it is not measurably equivalent to the interval with the Lebesgue measure. 

2.8 a-algebras and the Total Independence Theorem 

We fix a natural number k and we denote by [k] the set {1,2,..., fc}. Let X^.i, 2, • • ■ , Xi^k be 
fc copies of the finite set Xi and for a subset A C {1, 2, . . . , fc} let Xi^a denote the direct product 
@j^A -^iJ- Let denote the ultraproduct of the sets Xi^A, with a Boolean algebra Va- There is 
a natural map pA ■ Xl''"! ^ X"^ (the projection). Let Ba be the cr-algebra of measurable subsets in 
X"* as defined in the previous sections. Define a{A) as p^^{Ba), the cr-algebra of measurable sets 
depending only on the A-coordinates together with the probability measure pA- For a nonempty 
subset A C [fc] let A* denote the set system {B\B C A , \B\ ^ \A\ - 1} and let cr(A)* denote 
the cr-algebra {a{B)\B G A*). An interesting fact is (as it will turn out in Section|6]l that (j{A)* is 
strictly smaller than (J (A). The following figure shows how the lattice of the various cj-algebras look 
like. Recall that if B C A are cr-algebras on X with a measure /i and g is an ^-measurable function 
on X, then E{g \ B) is the ,B-measurable function (unique up to a zero measure perturbation) with 
the property that 



for any Y B (see Appendix). If A G ^ we say that A is independent from the u-algebra B if 
E{xA I B) is a constant function. One of the main tool in our paper (the proof will be given in 
Section|5]) is the following theorem. 



A\uT=i A.,CA\ ui^, A,eP. 



Since lim^-^oo U^=i ^fc) = 0, A\ At e N thus U^^Afc = A I 




16 




Figure 1 : The cr-algebras 

Theorem 5 (The Total Independence Theorem) Let Ai, A2, . . . Ar be a list of distinct nonempty 
subsets of [k], and let Si, S2, ■ ■ ■ , Sr be subsets q/X^'^l such that Si G <7{Ai) and E{Si\(7{Ai)*) is 
a constant function for every I < i < r. Then 

tx{Si n 52 n . . . n 5,) = ^^(51)^(^2) ■ • ■ KSr). 

3 Correspondance Principles and the proofs of the Removal and 
Regularity Lemmas 

3.1 The ultraproduct method and the correspondence principles 

The ultraproduct method for hypergraphs rehes on various correspondence principles between the 
following objects that are infinite variations of the concept of a fc-uniform hypergraph. 

1 . An infinite sequence of hypergraphs Hi, H2, ■ ■ ■ in TCk ■ 

2. The ultraproduct hypergraph H. 

3. A fc-uniform Euclidean hypergraph 7i C [0,1]^ 

Additionally we will need correspondence principles between homomorphism sets 

{T{K,H,)}°Z, , T{K,Il) and T{K,H) 

for every fixed fc-uniform hypergraph K. Let {Hi C be a sequence of finite fc-uniform 

hypergraphs. Then the ultraproduct hypergraph H = [{i/i}^^] c X*"' is well-defined. Clearly, 
H is S'fc-invariant and has no repetitions in its coordinates. One can formally define the homomor- 
phism set T{K, H) for any finite fc-uniform hypergraph K exactly as in Subsection 12.11 Note that 
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we shall refer to any measurable S'fc-invariant set P C X'^ without repetitions in its coordinated a 
/c-uniform hypergraph on X. 

The following lemma is a trivial consequence of the basic properties of the ultraproduct sets. 

Lemma 3.1 (Homomorphism correspondence I.) The homomorphism set T{F, H) is the ultra- 
product of the homomorphism sets T(F, Hi). The induced homomorphism set Tind{F, H) is the 
ultraproduct of the homomorphism sets Tind{F, Hi). 

To state the next theorem we need some notation. For an arbitrary set S let r{S, m) denote the 
set of non-empty subsets of S of size at most m and let r{S) denote r(5, IS*!). The symmetric group 
Sn is acting on [n] and this action induces an action on r([n],m). Furthermore Sn is acting on 
[0, by permuting the coordinates according to the action on r{[n\, m). Let X, G, G2 be 
sets such that G2 Q G. Then we will denote the projection X'^ ^ X'^^ by Pq^. If a function / 
takes values in X^ then for an element a ^ G we denote the corresponding coordinate function by 
fa which is the same as the composition P^a} ° f ■ 

Definition 3.1 (Separable ReaUzation) For any k N a separable realization is a measure pre- 
serving map (f> : X*^ 1-^ [0, such that 

1. Any permutation n ^ Sk commutes with (j) in the sense that (j){'X-)^ — 4>{yi'^). 

2. For any D G [fc] ) and measurable set A C [0, 1] the set (f)^ [A) is in a{D) and is indepen- 
dent from a{D)*. 

Note that the fact that </> commutes with the S'^-action means that 0^(x'^) = (x) for each 

TT G Sk- The second condition in the previous definition expresses the fact that the functions 
of a separable realization depend only on the D-coordinates. Also, by Lemma lT2] of the Appendix 
and the Total Independence Theorem a separable realization cj) gives a parametrization of X'^ by 
|r([fc])| coordinates such a way that (f>^^ defines an injective measure algebra homomorphism from 
7\/(([0, A'') to a subalgebraof A^(X'=,i3[fe],Ai[fc]). The next theorem is the heart of the 
hypergraph ultraproduct method. The proof of it will be discussed in Section |6] 

Theorem 6 (EucUdean correspondence) Let Abe a separable sub-a-algebra ofa^k] on X'^. Then 
there is a separable realization (j) : X''' [0, such that for every A ^ A there is a measurable 



Corollary 3.1 Let E be an Su-invariant measurable subset ofHJ^. Then there is a separable real- 
ization (j) and Sk-invariant measurable set W C [0, l]'"^!'^]) such that /i((/i^^(W^)AE) = 0. 

The following definition and lemma will be needed to state the main correspondence between 
homomorphism sets. 




1(B) AA) =0. 
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Deiuiition 3.2 (Lifting) Let </> : X*' i-> [0, be a separable realization and let n > k be an 
arbitrary natural number Then a measure preserving map tp : X" i— > [0, 1]''(W>'=) is called a degree 
n lifting of(j) if Pr{[k]) ° equal to cp o P^^ on X" and tp{x)'^ = tp{x'^)for all permutations 

IT G Sn. 

Lemma 3.2 (Lifting exists) Let cp : X''^ ^ [0. l]''*!*"!) be a separable realization and let n > k be 
an arbitrary natural number. Then there exists a degree n lifting ip of (j). 

Proof Let A e r{[n], k) be an arbitrary set with t elements and let tt G be a permutation 
such that = [t]. We define V'a(x) to be 0[t](P[/j](x'^)). Using the fact that <f) commutes with 
the Sk action we obtain that (pA o P[k] = V'a for every A e Now if 7r2 is an arbitrary 

permutation from 5„ then the A-coordinate of ip{-x)^^ is the A^^ -coordinate of V'(x) which is 
the j4-coordinate of V'(x'^^)- This proves that ip cormnutes with Sn- It remains to show that ij) is 
measure preserving. The coordinate functions ipA are constructed in a way which guarantees that 
they are measure preserving. Let I a C [0, 1] be intervals of length I a for every A e r([n], fc) and 
let 

W= X{ Ia 

Aer{[n],k) 

be their direct product. Since every measurable set in [0, can be approximated by the 
disjoint union of such cubes it is enough to check that '0^^ preserves the measure of such a set W . 
The preimage tjj^'^iW) is the intersection of the preimages iIj^{Ia) which are in (j{A) and are 
independent from (t{A)* . Now the Total Independence Theorem completes the proof. I 

Lenuna 3.3 (Homomorphism Correspondence n.) Let W C [0, be an Sk-invariant mea- 
surable set and let E be the preimage of W under some separable realization (j). Then for an 
arbitrary finite hypergraph K 

i,-\T{K,W))=T{K,E), 
where ip is a \K\ degree lifting ofcj). Similarly, 

Proof Assume that the vertex set of K is defined on [n] and that the edges of K are 
7ri([fc]), 7r2([fc]), . . . , 7rt([fc]) for some permutations tti, 772, . . . , TTt in Sn- Let E2 C X" be the 
preimage of E under the projection P[fc] and let W2 C [0, 1 ^ be the preimage of W under the 
projection Pr([k\)- By definition we have that 

t 

r(if,E) = f|Ej' 

i=l 

and 

t 

T{K,W) = {^W^\ 

i=l 
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Since tp is a lifting of (f> the first lifting property shows that t/j~^{W2) = E2. Furthermore since ijj 
commutes with the elements of Sn we get that — Ef for every vr G Sn- This completes 

the proof. I 

3.2 The proof of the Hypergraph Removal lemma 

Lemma 3.4 (Infinite removal lemma) Let H be the ultrapwduct of the k-uniform hypergraphs 
Hi, H2, ■ ■ ■ and let F be a finite k-uniform hypergraph such that T{F, H) has measure 0. Then 
there is a 0-measure Sk-invariant subset I o/H such that T{F, H \ I) is empty. 

Proof. We use Corollarv l3.1l for the set H and we get a separable reaUzation ip and a measurable set 
W C [0, satisfying the statement of the corollary. Let D denote the set density points in W. 
Lebesgue's density theorem says that W\D has measure 0. Furthermore D will remain symmetric 
under the action of the symmetric group on [0, Let D be the preimage of D under the map (f>. 
Using the first property in definition l3.1l we obtain that D is 5*^- invariant. Furthermore the measure 
of HAD is 0. 

Now let F be a fc-uniform hypergraph on the vertex set [n] and let iphe a degree n lifting of </). 
Lemma|33]shows that T{F, D) is the preimage of T{F, D) under ip ^. On the other hand T{F, D) 
is the intersection of finitely many sets consisting only of density points. This show that T{F, D) 
and thus T{F, D) is either empty or has positive measure. This means that the set I = H \ D 
satisfies the required condition. I 

Proof of the hypergraph removal lemma. We proceed by contradiction. Let K a fixed hy- 
pergraph and e > be a fixed number for which the theorem fails. This means that there is a 
sequence of hypergraphs Hi on the sets Xi such that limi^oot{K, Hi) — Q but in each Hi there 
is no set L with the required property. Again let H C X'' denote the ultraproduct hypergraph. 
Then p{T{K, H)) = lim^^ t{K, Hi) — and thus by the previous lemma there is a zero measure 
S**: -invariant set I C X*^ such that T{K, H \ I) = 0. By the definition of nullsets, for any ei > 
there exists an ultralimit set J C X*^ such that I C J and p{3) < ei. We can suppose that J is Sk- 
invariant as well. Let [{ Jij^^] = J, then for w-almost all i, Ji is Sk- invariant, \ Ji\ < ei \Xi\'' and 
T{K, Hi\Li) = 0, where Li is the set of edges {xi,X2, ■ ■ ■ , Xk} such that {xi,X2, ■ ■ ■ , Xk) S Ji. 
Clearly, \Li\ < \ Ji\, hence if ei is small enough then \Li\ < e('"^'') leading to a contradiction. 

3.3 The existence of the Hypergraph Limit Object 

Proposition 3.1 Let {Hi}°^i be a sequence of k-uniform hypergraphs and let H be their ultra- 
product hypergraph. Assume furthermore that (j) : X*^ > [0, is a separable realization such 
that there is an Sk-invariant measurable set H C [0, with p{(j)^^ {'H)A'H.) — 0. Then for 
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every k-unifonn hypergraph K we have that 

lim t{K,Hi) = t{K,n). 

Proof. Let K hea k uniform hypergraph on n vertices and let iphe a degree n lifting of cf). Lemma 
13.1 [ implies that t{K, H) lim^^ T{K, Hi) furthermore, using that is measure preserving, lemma 
[33]impUes that t{K, H) = t{K, H). I 

The following theorem is an immediate corollary of the previous one. 

Theorem 7 (Existence of the Umit object) If {Hi]°^^ is a convergent sequence of k-uniform hy- 
pergraphs then there exists a Euclidean hypergraph Ti. C [0, such that limi^aot{K, Hi) = 
t{K, TL) for every k-uniform hypergraph K. 

3.4 The proof of the Hypergraph Regularity Lemma 

Suppose that the theorem does not hold for some e > and F : N (0,1). That is there exists 
a sequence of fc-uniform hypergraphs Hi without having F(j)-equitable j-hyperpartitions for any 
1 < i < * satisfying the conditions of our theorem. Let us consider their ultraproduct H C X*'. 
Similarly to the proof of the Removal Lemma we formulate an infinite version of the Regularity 
Lemma as well. 

Let Kriyi) denote the complete r-uniform hypergraph on X, that is the set of points 
{xitX2, ■ ■ ■ ,Xr) e such that Xi ^ xj if i ^ j. Clearly Kr(X.) C X*" is measurable and 
^[r] {Kr{^)) = 1 . An r-uniform hypergraph on X is an S'^-invariant measurable subset of Kr(X.). 
An Z-hyperpartition7i is a family of partitions KrCX.) = U^^j^pj, where pj is an r-uniform hyper- 
graph for 1 < r < fc. Again, an /-hyperpartition induces a partition of Kk (X) into 7i-cells exactly 
the same way as in the finite case. It is easy to see that each 7i-cell is measurable. 

Proposition 3.2 (Hypergraph Regularity Lemma, infinite version) For any e > 0, there exists a 
Q-equitable l-hyperpartition (where I depends on Hj Ti. such that 

• Each PJ is independent from cr([r])*. 

• (HAT) < e, where T is a union of some Ti.-cells. 

Proof. Let be a separable realization for H that is such a 4> that there exists an S'fc -invariant subset 
Q ^ [0,1]^ '"^ such that (0-1 (<3) AH) = 0. Since Q is a Lebesgue-measurable set, there exists 
some / > such that Vol2k_i{QAZ) < e, where Z is a union of Z-boxes (see Subsection l2.3b . 

By the usual symmetrization argument we may suppose that the set Z is invariant under the 
Sfc-action on the Z-boxes. For each 1 < r < fc we consider the partition X*" = uj^j^pj, where 
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pj = (f)'^^^ {'^'Y- , |) . We call the resulting Z-hyperpartition H. Note that by the S'^-invariance of the 
separable realization each pj is an r-uniform hypergraph and also pj is independent from ct ( [r] ) * . 

Now we show that C is an 7^-cell if and only if C = (/)^^(U7rg5^7r(Z?)), where D is an /-box in 
[0, 1]^''^^. By definition a = (ai, 02, ... , afc) e X'^ and b — {bi, 62, ... , bk) £ X'^' are in the same 
W-cell if and only if there exists n & Sk such that (oi^, 0^3, . . . , ^| ) and (^i^^, , bi^^2) ■ ■ • > ^i^{\A\)) 
are in the same P^ for any AC [k]. That is ip{a) and (f>{h'^) = (0(b) )^ are in the same Z-box. 

Since Z is a union of 5fc-orbits of /-boxes the set T = (p^^{Z) is the union of Ti, cells. Using that (p 
is measure preserving the proof is complete. I 

Now we return to the proof of the Hypergraph Regularity Lemma. First pick an r-hypergraph PJ on 
X such that /X[^](pj APJ ) = 0, PJ e Vir] and U^^iPJ i^,.(X). Let [{P^^jtZi] = H ■ Then for 
w-abnost all indices Uj^^P^^ — Kr{Xi) is an P(/) -equitable /-partition and \HiA < 
e('^') for the induced 7i-cell approximation. Here U^j^j^Cm is the 7i-cell approximation with 
respect to the /-hyperpartitions u'^^pj i^r(X) and [{C^j},^i] = Cm. 

The only thing remained to be proved is that for w-almost all indices i the resulting /-hyperpartitions 
are F(/)-regular. If it does not hold then there exists 1 < r < fc and 1 < j < / such that for almost 
all i there exists a cylinder intersection Wi C Kr{Xi), \Wi\ > F{l)\Kr{Xi)\, such that 



> F{1) . (3) 



\Kr{X,)\ |Wi| 

Let W — [{Wijj^i] . Then W e a{[r])*. Hence pj and W are independent sets. However, by Q 

/XH(PiVH(W)^A*M(Pinw), 

leading to a contradiction. I 

3.5 The proof of the Hypergraph Sequence Regularity Lemma 

Let us consider the ultraUmit H of the hypergraph sequence {Hi}^^^ as in the proof of the regu- 
larity lemma together with the /-hyperpartition H given by the partition U^^^pj — /^^(X), where 
[{^r = P^ . If s > 1, then for tj-almost all indices 

• u'^j^P/j = Kr{Xi) is an i-equitable i -regular partition 
. |i/.AU^,^iC;„|<e(l^'l). 

• Ti has combinatorial structure C, where Ti — Uf^^j^C^. 
Also, by Lemma lTTI and Lemma [33] 

limt(P,TO = t(P, (U?„^iCm)) = t(P,C) . 

Thus for a;-almost alH, \t{F,Ti) — t{F,C)\ < ^ . Therefore we can pick a subsequence i/- satisfying 
the four conditions of the Hypergraph Sequence Regularity Lemma. I 



22 



3.6 Testability of Hereditary Properties 

We omit here the definition of Property Testing but we state a theorem which is equivalent with the 
statement that hereditary hypergraph properties are testable. 

Theorem 8 Let J- be a family of k-uniform hypergraphs. Then for every e > there is a S = 
(5(e, J-) > and a natural number n = n{e, J-) such that if H satisfies tind{F, H) < S for every 
F £ J- with V{F) < n then there is a hypergraph H' on the vertex set X of H with \HAH'\ < 
e(l^l) such that t^^^iF, H') = 0/or every F £T. {see also SH^M and 1^) 

Proof. We proceed by contradiction. Assume that there is a sequence {Hi]°^^ and e > such that 
limi^oo tmd{F, Hi) = for every F E T, however no member of the sequence can be modified in 
the way guaranteed by the theorem. Let us repeat the construction used in the proof of the Regularity 
Lemma again. Let H be the ultralimit hypergraph of We use Corollary |TT| for the set H 

in order to obtain a separable realization (j) and a measurable set W C [0, IJ^'d'^l) satisfying the 
statement of the corollary. Then tind{F, W) — limi^ tind{F, Hi) = for every F <E !F. 
Thus there is an Z-step Euclidean hypergraph (a union of /-boxes) W' such that Vol{WAW') < e/4. 
Let Q be the preimage of W' under cf). Denote by C the combinatorial structure of W' . As in the 
proof of the regularity lemma for each 1 < r < fc we consider the partition X*" = U^^j^pj , where 
pj — f ) • We call the resulting /-hyperpartition H. The set Q is the union of some cells 

in H. Again, we modify the sets pj to obtain the sets [{i^i? = Pi • Consider the resulting 
/-hyperpartitions Hi on Xi and for every i denote the union of Tii-cells with coordinates in C by 
Qi. That is Qi = G{W' , Hi, [ui]), where [n^] is the vertex set of H^ . Note that G{W' , Hi, [n^]) is 
a random hypergraph, nevertheless it always takes the same value. Then of course, /i(Q' AQ) = 
where Q' is the ultralimit of the hypergraphs {Qi}°Zi- 

Now we consider the random hypergraph model Gi — Gr{W,Hi, [rii]). For an ordered set S — 
{ii,i2, ■ ■ ■ jik) S [rii]'^ let denote the random variable which takes 1 if 5* is in GiAQi and takes 
elsewhere. One can easily see that the expected value of Ys is P'^^^Yol{{W' fl B)A{W fl B)) 
where B is the box representing the coordinate of the directed cell containing S. This shows that 

Ei\G,AQ,\) = J2 ^(^s) = J2 ICf'^'yoliiW n BiC))A(W n B{C))) , 
s c 

where C runs through the directed cells of Hi and B{C) is the box in [0, 1]^*^^^ corresponding to 
the coordinate of C. 

Observe that lim^^ \C( \/ {rii)^ — P where C/ is the cell in Hi corresponding to the coordinate 
/ . Indeed, the ultralimit of {C/ is a cell in the /-hyperpartition of X. That is 

lim ^^'^'f^''^ = YoliW nW)< e/4. 
On the other hand we know that lim^ ^'^'^^'^ = |QAH| < e/4. 
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Consequently, lim^^ g(|G^AH,|) < . Note that by probability 1, tind{F, G,) = for any F e J^. 
That is there exists a hypergraph H- which is a value of the hypergraph valued random variable Gi 
such that 

• tind{F,H'i)=0. 

This leads to a contradiction. I 

4 Uniqueness results and metrics 

4.1 Distances of hypergraphs and hypergraphons 

Let U and W be two measurable sets in [0, The distance di [U, W) is defined as the mea- 
sure of their symmetric difference U AW. Let be a fc uniform hypergraph. It is clear from the 
definitions that 

\t{F, U) - t{F, W)\ < \E{F)\dr{U, W). 

We can also introduce a distance using subhypergraph-densities. 
Let 6 = 6w{U,W) denote the smallest number such that 

\t{F,U) - t{F,W)\ < \E{F)\6, for any J'. 

Clearly, S^{U, W) < di{U, W). It is easy to see that 5^ satisfy the triangle inequality. On the 
other hand is only a pseudometric since (as we will see) there are different sets U and W with 
Sw{U, W) = 0. Our goal is to understand which two functions have distance in the pseudometric 

For every set S <E r{[k]) we denote by As the tj-algebra generated by the projection 
[0, [0, Let Ag denote the cr-algebra generated by all the algebras At where T is a 
proper subset of S. for every S £ r{[k]) We say that a measurable map : [0, l]'"'!'^]) i-^ [0, l]'"'!'^]) 
is structure preserving if 

1 . is measure preserving. 

2. (p-HAs) C As. 

3. The sets (t)g^{I) are independent from A*g for every measurable set I C [0, 1]. 

4. (j) o TT = n o (j) for every permutation in Sk. 

The following lemma shows that structure preserving maps do not change the homomorphism 
densities in hypergraphons. 

Lemma 4.1 For any structure preserving map 4> we have that Sw{U, 4>~^(U)) = 0. 
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Proof. We need to prove that for any finite fc-uniform hypergraph F 

t{F,U)=t{F,r\U)). 

Mimicking the proof of Lemma |3^ we can easily see that there exists a map (f) : [0, 
[0, such that (j) commutes with the S'„-action and 

where is the projection to the [fc]-coordinates. Therefore, we have the following formula for the 
homomorphism sets: 

Hence the lemma follows. I 

Definition 4.1 A structure preserving map ip : [0, t-^ [0, Ij^d'^l) is called a structure pre- 

serving equivalence if there is a structure preserving map (j) such that both ip o (j) and cf) o ip are 
equivalent to the identity map on [0, ( recall that equivalence means that two maps define the 
same measure algebra homomorphism). 

Now we introduce the pseudodistance 5i by the formula 

5i{U, W) = ini di (0-1 ([/) (T^)) , 

where (f) and ij] run through all the structure preserving transformations. We will prove the following 
uniqueness theorem (see ifTOl for the graph case) 

Tlieorem 9 (Uniqueness I.) 6^(17, W) = if and only if there are two structure preserving mea- 
surable maps 4>,4: : [0, l]'"'^!''!' y-f [0, l]'''^'''!^ such that the measure of tj)^^ {U)A'ip~^ {W) is zero. 

Tlieorem 10 (Uniqueness II.) (5^,(C/, W) = if and only ifSi{U, W) 0. 
4.2 Technical Lemmas 

First we prove a simple real analysis lemma. 

Lemma 4.2 Let Y C [0, 1]" be a measurable set independent from the a-algebra An-i generated 
by the projection onto the first {n—l)-coordinates. Then there exist measurable subsets Xk C [0,1]" 
in the form 

Xk = (Al X Bf) U (A^ X U . . . U (A^^ X B,;;J 

such that limfc^oo Vol{XkAY) = 0, where A'l U Af^ U . . . U A'^^ is a measurable partition of 
[0, and X{B'[) = X{B^) = ... = A(B^J = Vol{Xk). Obviously the sets Xk are all 

independent from An~i. 
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Proof. Fix a real number e > 0. Let be a union of Z-boxes in [0, 1]" such that I > and 
Vol{Hl\Y) < . By Fubini's Theorem, for almost all ^; G [0, A(A^) = Vol{Y), where 

= {te [0,1], {z,t)eY}. 

For each /-box T in [0, 1]""^ let 

Ht = {s€[Q,1],Txs£H}. 
Lemma 4.3 The number ofl-boxes in [0, for which \X{Ht) — Vol{Y)\ > ^ is less than 

t in—l 
10 ' 

Proof. By Fubini's Theorem, 

\X{Ht) - Vol{Y)\ < Vol{HAY) . 

T 

Hence the lemma follows. I 

Now the set is constructed the following way. Pick an integer m such that | ^ — Vol{Y) | < ^ • 
If for an Z-box T \\{Ht) - Vol{Y)\ < ^ then add or delete less than Z-boxes of H above T to 
obtain exactly m boxes. On the other hand if \X{Ht) — Vol{Y)\ > jq, then just pick m arbitrary 
boxes above T. Then is in the right form and Vol{X^AY) — > as e ^ 0. I 

The following lemma establishes the functorality of separable realizations and structure preserving 
maps. 

Lemma 4.4 Let (^i : X'^ ^ [0, be a separable realization and p : [0, ^ [0, be 
a structure preserving map. Then po(j)isa separable realization as well. Similarly the compositions 
of two structure preserving maps, or the inverse of a structure preserving equivalence is a structure 
preserving map. 

Proof. For the first part is enough to prove that if M C [0, M & As for some S C [k] such 
that M is independent from .Ag then (^^^(M) is independent from a (5)*. 
First suppose that M is in block-form that is 

M = u7=i(AinBi), 

where for any I < i < n, Bi Bs and Ai e A^g that UjLi^i is a measurable partition of 
[0,l]''(['=l).LetlG£7(5)*.Then 

r\M) n I = u?=i(ri(AO n I) n r\Bi) . 

Hence 

n 

ii{ct>-\M) n I) = ^ iJi{r\Ai) n ' (Si)) . 
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Note that n{4>-^{B,)) = Vol{M) and YJl=i /^((/-"H^O n I) = Therefore (t)-^{M) is inde- 
pendent from <j{S)* . By Lemma l4!2l any set in As which is independent from A*g can be approxi- 
mated by sets in block-form, thus the proof of the first part of our lemma follows. The second part 
can be proved completely similarly. I 

The following lemma is a baby-version of the Total Independence Lemma. 

Lemma 4.5 For any S C [k], let Xs G As such that Xs is independent form A*g. Then {Xs}s<z[k] 
is a totally independent system. 

Proof. We need to prove that for any set-system {SiYi=i ^ii^]) 

r 

Vol{nU,Xs,)^llVol{XsJ (4) 

Let us proceed by induction. Suppose (|4|i holds for a certain Let{Si}l^l ^ r([fc]) be a set-system 
and suppose that Sr+i is not a subset of Sj, for I < j < r. It is enough to see that 

r r+1 

Vol{Xs^^, n fl XsJ = n Vol{Xs,) (5) 

i=l 1=1 

By Lemma l4!2l we may assume that Xsr+i is in the block-form U"^]^(Ai n Bi), where U^^^Ai is a 
partition of [0, such that are in the cr-algebra Cs generated by {As} sc[k], s^Sr+i 

and {BJr^i C Bs- Since n^^^Xs, G Cs, Q follows. I 

We shall need the auxilliary notion of structure preserving measure algebra embeddings. Let £'"([''l) 
denote the measure algebra associated to ([0, B, A). For any S C [k] let Bs be the subal- 
gebra generated by the S'-coordinate, that is for any S C [k], {Bt}tcs are jointly independent 
subalgebras generating As- We say that an injective homomorphism $ : £''([''1) ^ is a 

structure preserving embedding if 

1 . $ is measure preserving. 

2. <P{Bs) C As for any 5* C [k]. 

3. ^{Bs) is independent of A*g. 

4. $o7r = 7ro$ for every permutation in Sk- 

Lemma 4.6 Let $ : £''(['^1) i— > £''(['=1) be a (measure algebra) structure preserving embedding. 
Then $ can be represented (see Lemma [7.71 1 by a structure preserving map tf> : [0, l]''^!'"'^ i— > 
[0,l]K[fc]). 

Proof. Let us consider the map : B[i] A[i]. By the fourth axiom of structure preserving 
embeddings the image of <l>[i] consists of -invariant elements. We claim that we can represent 



27 



by maps : [0, IJ' d'D ^ [0, 1] such that is S[i] invariant for every measurable set 

I ^ [0, 1]. First we represent (f>[j] by a measurable map (p'^^y Now Lemma |678] implies that S[i\ 
acts freely on [0, withe measurable sets Qi, Q2, ■ ■ ■ , Qa- Let G ~ ^iQi- If x e G then 
we define 4>[i\{x) as 4)'[i]{'K{x)) where tt G S\i\ is the unique permutation with ti{x) G Qi. If 
X G [0, \ G the (/)[i](a;) is defined to be 0. For a general set S G r{[k]) with \S\ = i we 

define 4>s{x) to be (7r(a;)) where tt G S'ffej is an arbitrary permutation with Tr{S) — [i]. The 
S'[j]-invariance of guarantees that 4>s', [0, l]''^'^) [0, 1] is well defined and represents the map 
$5 : Bs As- It is easy to see that the map x ser([k])4'S o Ls isa structure preserving map which 
represents I 

Lemma 4.7 Let W C [0, be an l-step hypergmphon and let : [0, ^ [0, be 

a structure preserving map with T — (j)~^{W). Then there is a structure preserving equivalence 
(see Definition \4.1i ip such that WA^p^^(T) has measure 0. 

Proof. Let denote the set - G A[t] for i = 1, 2, . . . , fc. By the definition of 

structure preserving maps the set P/ is independent from A^f^ has measure l/l and is symmetric 
under S\t\- Using Lemma l676l f or every t = 1, 2, . . . , /c we construct a cr-algebra C\t\ ^ A[t\ such 
that 

1. C[f] is an independent complement for Ay^ in A[t] 

2. PI G C[f] for 1 < i < ; 

3. Every set in C[t] is invariant under the symmetric group S[t]. 

In general, for a set 5* G we introduce Cs as 7r(C[|5|]) where tt is an arbitrary permutation 

taking ^[[S*!] to ^5. By the invariance of C[^s\] this is well defined. 

Now the system of cr-algebras {Cs}ser{[k]) satisfies the following properties. 

1. The cr-algebras Cs generate [0, where S runs through the elements in r([fc]). 

2. Cs C As and Cs is independent form A*s- That is by Lemma [43] the algebras Cs are totally 
independent. 

Now let be a measure algebra isomorphism from [0, 1] to C[t] taking [{i — 1) /I, i/l) to P^. Using 
the action we also define maps ps for every S G r{[k]) satisfying tt o ps — ps o tt for every 
TT G S[k]- Since the algebras Cs are totally independent, by Lemma [TJ] the product of the maps 
Ps creates a measure algebra equivalence from [0, Ij^d'"]) to itself which is a structure preserving 
equivalence. 

Lemma 4.8 For every pair U,W C [0, of hypergraphons and e > there is a structure 
preserving equivalence <j) : [0, l]''^^) [0, ll'^*^!) such that di{U, (jj^^iW)) < W) + e. 
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Proof. Let Ti, T2 be a two ?-step hypergraphons with di{Ti, W) <e/^ and di{T2, U) < e/8. We 
know that there are two structure preserving maps V'l and ^2 such that di{^p2^ (U), ^i^{W)) < 
Si{U, W) + e/8. By Lemma |4!7] there are structure preserving equivalences pi and p2 

withdi(pr'(Ti), Vr'(ri)) = Oand di{p2\T2),^2\T2)) = 0. Now 

d,{p^\T,),p2\T2)) < d,{U,W) + e/4. 

By Lemma l4~4l p2 o p~^ is a structure preserving equivalence that takes W into a set whose distance 
from U is at most 5i ([/, W) + e. 

4.3 A concentration result for W^-random graphs 

Theorem 11 (Concentration) Let W C [0, be a hypergraphon. Then 

o 

f rr 

Pr(|io(F,G(M/, [n])) - t{F, 1^)1 > e) < 2exp(- ^^^^^^^ J . 

The proof of the lemma is identical with the proof of Theorem 2.5 in ||9l, that was used for the 
case k ~ 2. For the sake of completeness we repeat the proof. 

Proof. Let us consider the system of random hypergraph models Gi, G2, . . . , G„ such that the 
distribution of Gn is G{W, [n]) and G; is the sub hypergraph in G„ induced by [i]. It is clear that 
the distribution on Gi is the same as G{W, [i]). Let F be a fixed fc-uniform hypergraph on the vertex 
set [r]. For any injective map ^ : [r] [n] we denote by A^jp the event that ■0 is a homomorphism 
from F to G„ . Let 

B,n = ^^^^VPr(A^|G„0. 

-4, 

The sequence Bq, Bi, . . . , Bn is a martingale, where Bq — Pr(A^) — t{F, W) and 

Bn = Pr{Aji, I Gn) is 1 if ?/) is a homomorphism and elsewhere. This implies that i?„ = 

toiF, Gn). Now we have that 

\B.m ~ S„,_i| < (^- 0! y^|p^(^ I _ p^.(^ I G,„_i)|. 

The terms in the sum for which m is in not in the range of if) are and all the other terms are at most 
one. The number of terms of the second type is r and so |i?„j — -Bm_i| < r/n. By applying 

Azuma's inequality we get that 

Pr(|io(i^, Gn) - t{F, W)\>e)^ Pr(|i3„ - Bo\ > e) < 2exp(^-^-^) = 2exp(-^) 
I 

Theorem 12 (Convergence) The sequences to{F, G{W, [n])) and t{F, [n]) converge to 

t{F, W) with probability one as n goes to infinity. 

Proof. The convergence of to l^", ^(W, [n])) follows from LemmafTTIand the Borel-Cantelli lemma 
since for every fixed e > the sum of the right hand side in the inequality is finite. I 
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4.4 Proof of the Uniqueness Theorems 

Let X be the ultra product of the sets [n]. Let Z„ be the random variable which is a random point 
in [0, l]' (N,fc) with uniform distribution as in Section 122] and let {r"} : [n]^ [0, 1]'''!''!) be the 
associated random coordinate systems. The ultraproduct function r — [{t"}^^^] on X*^ will also 
be called random coordinate system 

Lemma 4.9 The random coordinate system r : X'^ [0, is a separable realization with 
probability one. 

Proof. Let / C [0, 1] be a measurable set and S E r{[k]). It is clear from the definition that r^^(/) 
is in cr(S'). We show that (with probability one) Tg^{I) is independent from cr{S)* and has measure 
A(/). 

Let Ia,b be an open interval with rational endpoints a, b. Let denote the ultra product 
[{(^s)^(^i,f))}5^=i]- By Proposition 16. 1 1 (and the remark after the proposition) we have that al- 
most surely la.b has measure b — a and is independent from (t{S)*. Then we have 

Ia+e,6-e C Tg^{Ia,b) C Ia,b 

for every small enough rational number e > 0. Since there are only countable many rational numbers 
this holds simultaneously for every rational number with probability 1. This implies that Tg^{Ia,b) 
has measure b — a and is independent from a{S)* with probability 1. Since ts is measurable and 
measure preserving on rational intervals it has to be measure preserving on Lebesgue sets. By 
approximating an arbitrary measurable sets by unions of disjoint intervals we get the independence 
fromcr(S')*. 

Now let B C [0, be a box of the form nser([fc]) -^s where Is is an interval with rational 
endpoints. The measure of B is equal to nser([fe]) ^(^s)- The set t^^{B) is equal to 

^Ser([k])Ts^{Is) ■ 

Therefore using the total independence theorem we obtain that with probability one t^^{B) = 
X{B). Again this holds simultaneously for every rational interval system with probability 1. As a 
consequence r is almost surely a measure preserving map. 

The symmetry on r under Sk is clear from its definition. I 

Lemma 4.10 Let W be a hypergraphon. Then with probability one the ultraproduct 

H = C X'' has a separable realization : X'' i-^ [0, l]'"'^'''!) such that 

HA(/)^^(VF) has measure 0. 

Proof. We will use that the set H = can be written as the ultraproduct 

[{(t")^^(T/F)}5^]^]. Our goal is to prove that almost surely HAr~^(M^) has measure 0. First 
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by applying Theorem[T2]to a single hyperedge F we deduce that H has measure X{W) with proba- 
bility one. 

If W is open then t~^{W) is contained in H and, by Lenima |4!9l has measure \ W\ with proba- 
bility 1. This means that with probability 1 the set r^^(W^)AH has measure 0. 

For an arbitrary measurable set W C [0, IJ'^d'^D and e > there are open sets Oi and O2 in 
[0, such that Oi\ O2 ^ W C Oi md \02\ < e. We have that 

(r")-i(Oi) \ (r")-i(02) C ir'^r'iW) C (r")-i(Oi) 

and thus by taking the ultra product 

[{(r")-^(Oi)},T.i] \ [{(r")-^(02)},T=i] C H C [{{T-y^O,)}^^,}]. 

Using our observation about open sets and that t is measure preserving with probability 1 we obtain 
that the measure of HAr^^(M^) is at most e. By Lemma |4~9] the proof is complete. I 

Proof of Theorem |9] 

Let U and W be two functions with W) = 0. This means that G{U, [n]) and G(W, [n]) are 

equal to the same distribution Z„. Let H = [{Z„}^]^]. By Lemma l4. 101 with probability one there 
are two separable realizations (j)i,4'2 '■ X" 1-^ [0, IJ'd*^!) such that (j)^^{U), (f>2^{W) and H differ 
only in a zero measure set. Let A denote the separable sigma algebra generated by and 4>2- By 
the Euclidean correspondence (Theorem|6ll there is a separable realization 03 : X*^ 1-^ [0, 
corresponding to the algebra A. The maps (j>i and (/)2 define unique structure preserving maps V'l 
and 7/12 on the measure algebra £''(['=1) such that {(t>3)^^^i{S) is equivalent with (j)~^{S). This 
means that "01(1^) — ^2{W) in the measure algebra £''(['^1'. Therefore by Lemma |43] our theorem 
follows. I 

Proof of TheoremUl 

By the previous theorem, if 6w{U, V) = 0, then Si{U, V) = 0. On the other hand, if 5i{U,V) ^ 
then by the fact that 6^u iU,V) < di {U, V) and Lemmagl] 5^ {U,V) = 0. I 

4.5 The Counting Lemma 

Let C C be a symmetric combinatorial structure. Let y be a finite set. An {I, fc)-map is a 

function from r{V, k) to [I]. If E C_ V has /c-elements then the restriction of an {I, fc)-map f to E 
is an element x in [Z]''^^'. By specifying an arbitrary bijection g between E and [k] we can also 
represent x by an element x' in [Z]'"'!'^!). The S'fe -orbit of x' does not depend on g and so we can talk 
about the Sfc-orbit determined by the restriction of / to E. 

Let F be a fc-uniform hypergraph on V, let C C [/J'd'^l) be a symmetric combinatorial structure 
and let / be an {I, fc)-map on V. We say that / is a homomorphism form F to C if the restriction 
of / to any edge of F determines an Sk orbit which is in C. 
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The homomorphism density t{F, C) is the probability that a random {I, fc)-map on V" is a ho- 
momorphism. Note that here we take the uniform probabiUty distribution on all {I, fc)-maps. For 
technical reasons we will also need the number C, P) which is the probability that an {I, fc)-map 
chosen with distribution P on [/J''^^ '"') (the set of all (/, fc)-functions) is a homomorphism. 

Let Ti be an /-hyperpartition on a finite set U. Every injective map g : V t-^ U induces an 
(/, fc)-map fg on V such that for a set S* e r{V, k) the value f{S) is the index i of the partition set 
containing g{S). Let D{V, H) denote the probability distribution of fg if g is chosen uniformly 
at random from all the injective maps g : V U. Using this notation the following lemma follows 
immediately from the definitions. 

Lemma 4.11 Let C C [/J^d*^!) be a symmetric combinatorial structure and let H be a hypergraph 
on the set U which is the union ofTL-cells with coordinates in C. Then the probability t^{F, H) that 
a random injective map g :V ^ U is a homomorphism from F to H is equal to t{F, C, D{V, 'H)). 

Theorem 13 (Counting Lemma) Let {CAl^x increasing finite sets with l-hyperpartitions 
{^ili^i such that Tii is ei-regular and 6i-equitable with limi^oo = linii^oo Si = 0. Let fur- 
thermore C C be a symmetric combinatorial structure and Hi be the union ofHi-cells with 
coordinates in C. Then for every finite hypergraph F we have that 

lim t{F,Hi) = t{F,C). 

i — *C30 

Proof Let V denote the vertex set of F. Since {Ui}°^i is an increasing sequence of sets we have 
that 

lim t°{F,Hi) = lim t{F,H,). 

i — >oo i — >oo 

Now by Lemma P)-. Ill it suffices to show that Muii^oo P'{V,'Hi) is the uniform distribution on 

proceed by contradiction. By choosing an appropriate subsequence of {Ui}°Zi 
can assume that the limit of D{V, Hi) exists and it is not uniform. This means that there is a func- 
tion / : r{V, k) [I] such that 

lim Pi ^ , where pi = P{fg ^ f \ g : V t-^ Ui, g is injective) 

i — *oo 

holds. The set of all injective maps from V to Ui can be represented as the collection of elements in 
U^' with no repetitions in the coordinates. This subset in U^' has relative density tending to 1 as i 
goes to infinity. Now let Ti C defined by 

S£r(V.k) 

For S C r{V, k), tts -UY ^ is defined as Lp^ o L5, where Ls : ^ Uf is the natural 

projection and Lp^ is given by a bijection ps S ^ [[S*]]. Here P/gi denotes the corresponding 
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partition set in Tii. Since P^g,^ is symmetric in its coordinates the set Tj is independent of the 
concrete choice of the bijections ps- Let X denote the ultraproduct [{C/i}^i] and let H-'^ denote the 
ultraproduct [{T^}^]^] C X^. Furthermore for every S £ r(y, k) let denote the ultralimit of 
the partition sets T^s^iP^sp) fro^n TLi where i tends to infinity. Then 

H/= n 

S&(V.k) 

Also, the measure of H-^ is equal to Mra^pi = limi^ooK- Now the condition lim^^oo = 
limi^oo 5i = implies that for every S G r(V, k) the set H;^ has measure l^^ and that H;^ £ (^{S)* . 
The total independence theorem implies that the measure of H is providing a contradic- 

tion. I 

4.6 Equivalence of convergence notions and the Inverse Counting Lemma 

Let W C [0, be a hypergraphon. We say that a sequence of hypergraphs {Hi\°°^^ is struc- 
turally converges to W if for every ^-step hypergraphon U with 5i{W, U) < e and combinatorial 
structure C there is a sequence of ^-hyperpartitions Hi on the vertex sets of Hi such that 

1. is (5j-regular and (5j-equitable with limj_>oo Si — 

2. The union Ti of Hi-cells with coordinates in C satisfies lim supj_^oQ di {Ti,Hi) < e 

Definition 4.2 We say that an l-step hypergraphon U with combinatorial structure C is (e, 5)-close 
to a hypergraph H if there is an l-hyperpartition TC on the vertex set of H such that 

1. Ti. is both S-regular and S-equitable. 

2. The union ofTi-cells with combinatorial structure C satisfies di{H, Ty) < e. 

Theorem 14 For an increasing sequence {Hi}°^^ of k-uniform hypergraphs the following state- 
ments are equivalent: 

1. {Hi} is strongly convergent 

2. {Hi} is weakly convergent 

3. {Hi} structurally converges to a hypergraphon W which is also the weak limit of {Hi}. 

Proof. Let us start with (2) implies (3). By Theorem [7] we know that there is a hypergraphon 
W such that limi^oot{F, Hi) — t{F, W). Assume by contradiction that {Hi} is not structurally 
convergent to W. Then for some e > there is a (5 > 0, an Z-step hypergraphon U of combinatorial 
structure C with Si{U,W) < e and an infinite subsequence {Ji} of {Hi} such that none of the 
elements of {Ji} is {6, e + (5)-close to T. Let J be the ultra product hypergraph [{ Ji},^i] C X*^ 
and let </) : X*^ ^ [0, be a separable reaUzation of J. That is for some V C [0, IJ^'d'^D, 
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3A(j)-^{V) has measure zero. By LemmaEU S^,{V, W) = 0. By Theorem[TOl Si{V, W) = and 
consequently Si{U, V) < e. By Lemma \47f\ there exists a measure preserving equivalence p with 
di{p~^(U), V) < e + 5/2. This means that {p o 4>y^{U)A3 has measure at most e + 5/2. By 
Lemma l4~4l p o is a separable realization, hence {p o (p)^^{U) is a cell system with combinatorial 
structure C of a 0-regular and 0-equitable hyperpartition on X. This leads to a contradiction. 
The implication (3) => (1) is trivial. 

The implication (1) (2) follows from the Counting Lemma (Theorem [T3Tl. Let us fix a k- 
uniform hypergraph F on the vertex set V and with edge set E. According to the definition of 
strong convergence for every e > there is a fixed combinatorial structure C and modifications 
Hi of Hi with an at most e-density edge set such that every H^ is the union of the cells with 
coordinates in C of some hyperpartition which is getting more and more regular and balanced as i 
tends to infinity. The Counting Lemma implies that \imt^^ t{F, H[) = t{F, C). On the other hand 
\t{F, Hi) — t{F, H[)\ < \E\e. Using this inequality for every e > 0, we obtain the convergence of 
t{F, Hi). I 

The following immeadiate corollary states that if two hypergraphs have similar sub-hypergraph 
densities then they have similar regular partitions. 

Corollary 4.1 (Inverse Counting Lemma) Fix k > 0. Then for any e > there exist positive 
constants 5 = (5(e), (7 — C{t),N = N{e) such that if Hi, H2 are two k-uniform hypergraphs, 
\V{Hi)\ > N, \ V{H2)\ > N and di{Hi, H2) < 5, then there exists an l-hyperpartition U, 1 < I < 
C so that both hypergraphs are (e, e)-close to U. 

We also have a corollary of the Counting Lemma, using the notion of (e, (5)-closeness. 

Corollary 4.2 (Counting Lemma Finitary Version) For any finite k-uniform hypergraph F,l-step 
hypergraphon U and e > there is a constant 5 = 5{F, U, e) such that if a k-uniform hypergraph 
H is (S, 5)-close to U then 

\t{F,U)-tiF,H)\<e. 

(see also I^Wj ) 

5 The proof of the Total Independence Theorem 

Let {Xi}°^^ be finite sets as in Section[2]and fi : Xi [—d,d\ be real functions, where d > 0. 
Then one can define a function f : X ^ [— d, d] whose value at p = is the ultralimit of 

{fi{Pi)}l/Li- We say that f is the ultraproduct of the functions We shall use the notation 

f — Note that the characteristic function of the ultraproduct of sets is exactly the ultra- 

product of their characteristic functions. From now on we call such bounded functions ultraproduct 
functions. 
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Lemma 5.1 The ultraproduct functions are measurable on X and 



fd/j, = lim ■ 



Proof. Let —d < a < b < dhe real numbers. It is enough to prove that f[a {,] = {p G X | a < 
f{p) < b} is measurable. Let fi^^^-^ = {p e Xi \ a < fi{p) < b} . Note that is not 

necessarily equal to i[a,b] - Nevertheless if 

then Pn & V and f[a,b] = nj^jP„. This shows that f[a,b] is a measurable set. Hence the function f 
is measurable. 

Now we prove the integral formula. Let us consider the function gi on Xi which takes the value ^ 
if fi takes a value not smaller than ^ but less than for —Nk < j < Nk, where Nk = d2'^ + 1. 
Clearly | [{^i}^!] — f | < ^ on X. Observe that g = [{^iji^i] ^ measurable step-function on X 
taking the value ^ on Cj = [{/' ^ A"^-^]. Hence, 



2'= |Xi 



Also, |g — f I < ^ on X uniformly, that is \ J^i d/j, — g dp,\ < ^ . Notice that for any i>l 



2fc 



1 

< -u ■ 

- 2*= 



Therefore for each A; > 1, 



fdn- hm — 

X \Xi\ 



< 



2k-i ■ 



Thus our lemma follows. I 



Proposition 5.1 For every measurable function f : X — > [— d, d], there exists a sequence of func- 
tions fi : Xi ^ [— d, d] such that the ultraproduct of the sequence is almost everywhere 
equal to f . That is any element of L°°{K,Buj, can be represented by an ultraproduct function. 

Proof. Recall a standard result of measure theory. If f is a bounded measurable function on X, then 
there exists a sequence of bounded stepfunctions {/ifejfe^i such that 

• f = E^i^fc 

• \hk\ < 2F^,if fc > 1. 

• hk = En=i ^nXAl . where \J^t,-^A^ = X is a measurable partition, € M if 1 < n < rife. 
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Now let B^^ e V such that /^(A^ Ai?,^) = 0. We can suppose that U^^^B^^ is a partition of X. Let 
h'l^ = J2n=i ^nXB'' ™d f = X^fc^i '^fc- Then clearly f ' = f almost everywhere. We show that f is 
an ultraproduct function. 

Let B^ = [{Bn jli^il- We set Tk C N as the set of integers i for which Uj^*i;^i3^ ^ is a partition of 
Xi. Then obviously, Tj. G lo. Now we use our diagonalizing trick again. If i ^ Ti let = 0. If 
I e Ti, z e T2, . . . , z e Tfc, z ^ T^+i then define := E,ti(EnLi 4Xb^ ,) ■ If « ^ ^^fc ^r each 
k > 1 then set s, X^j^i ^Xs^ ^. ) • Now let p e Bj^ n B^ n . . . H B^^ . Then 

|(lim,s,)(p)-f'(p)| < 
Since this inequality holds for each fc > 1, f = [{s^}^]^]. I 

Leimna 5.2 Let A, B C [k] and let f : X^'^l be a a{B)-measurable ultraproduct function. 

Then for all y G X'^ the function iy is a{A n B)-measurable, where A'^ denotes the complement 
of A in [fc] and iy{x) — f (x, y). 

Proof Let f : Xl*^! ^ M be a (T(i3)-measurable ultraproduct function. Note that there exist 
functions /j : [^,] R depending only on the B-coordinates such that f is the ultraproduct 
of {fi]f^i. Indeed, let f be the ultraproduct of the functions gi. For x G Xi,B, let fi{x,t) := 

j j^^^^ I . Then fi depends only on the S-coordinates. Also by the integral formula of 

Lemma |5T| , lim^^ fi — f . Let y G X"^ , y — [{yi}'^i]- Then fy is the ultraproduct of the functions 
ff'. Clearly ff^ depends only on the A n B-coordinates, thus the ultraproduct iy is a{A D B)- 
measurable. I 

Proposition 5.2 (Fubini's Theorem) Let A C [fc] and let f : X^'^l R be a bounded cr([fc])- 
measurable function. Then for almost all y G X"^ , iy{x) is a measurable function on X.^ and the 
function 

y — * J-jj._4 iy{x)d^A{x) is X"^ -measurable. Moreover: 



i{p)dfi[k]{p) = J^^^ (^J^^ {y{x)dfiA{x)^ dfiA-iy) ■ 



Proof. First let f be the ultraproduct of {fi : R}iZi- Define the functions fi : 

[-d, d] by 

By Lemma ISTI 

\iia f,{y)= f{x,y)dfj.A{x) . 
Jy^A 

Applying Lemma lsTl again for the functions we obtain that 

limlX^^A^r^ fi{y) = / I / i{x,y)diiA{x)] d^XA<^{y) ■ 
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Then our proposition follows, since 

Now let f be an arbitrary bounded (T([fc]) -measurable function. Since there exists an ultraproduct 
function g that is a zero measure perturbation of f it is enough to prove the following lemma: 

Lemma: Let Y C Xl*^! be a measurable set of zero measure, then for almost all y G X"^ , 

{x e X^ I X-^ X y e y} 

has measure zero. 

Proof: Since y is a set of zero measure, there exists sets Z„ G T^\k\ such that 
• Y C Z„. 

Let Ln C X'^ be the set of points y in X"* such that 

MA({xeX^ I x^x2/ez„})>i^. 

Since Fubini's Theorem holds for ultraproduct functions it is easy to see that /i^c (i„) < Thus 
by the Borel-Cantelli Lemma almost all y G X"* is contained only in finitely many sets L„. Clearly, 
for those y, {x G X'^ | X"^ x y £Y} has measure zero . I 

Proposition 5.3 (Integration Rule) Let gi : Xl'^l R be bounded a{Ai)-measurable functions 
for i = 1, 2, . . . , m. Let B denote the a-algebra generated by a{Ai H ^2), a{Ai flAa), . . . , a{Ai n 
Am)- Then 



9192 ■■■9,n dfi[k] = / E{gi\B)g293 ■ ■ ■ 5™ dii[k\ ■ 
Proof. First of all note that E{gx \ B) does not depend on the -coordinates. By Fubini's Theorem, 

/ 919293 ■■ ■9m dtJ.[k]= (/ gi{x)g2ix,y) . . .gmix,y)dfiAi{x)] df^Aiiy) ■ 

Now we obtain by Lemma lS!2] that for all y G X'^i the function 

X g2{x,y)g3{x,y) . ..gm{x,y) {x G X-^^) 
is i?-measurable. This means that 

9i{x)92{x,y) . ..gm{x,y)diiAAx) = 



E{gi\B)(x)g2{x,y)g3{x,y) . . . g,n{x,y)dnAi{x) 



for all y in X^ij . This completes the proof. I 
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Now we finish the proof of the Total Independence Theorem. We can assume that \Ai\ > \Aj\ 
whenever j > i. Let Xi be the characteristic function of Si. We have that 

ii{Si n 5*2 n . . . n = / xiX2 • • • Xrdnik] ■ 

The Integration Rule shows that 

/ XiXi+i ■ ■ ■ Xr diJ.[k] = E{xi\<^{Ai)*)xi+i---XrdiJ.[k] 

= KSi) / Xi+lXi+2 ■ ■ ■ Xr dlJ,[k]. 

A simple induction finishes the proof. I 



6 The proof of the Euclidean Correspondance Principle 
6.1 Random Partitions 

The goal of this subsection is to prove the following proposition. 

Proposition 6.1 Let A c [k] be a subset, then for any n > 1 there exists a partition = 
5i U 52 U . . . U such that E{Si \ a{A)*) = i. 

Proof. The idea of the proof is that we consider random partitions of X"* and show that by probabil- 
ity one these partitions will satisfy the property of our proposition. Let fl = Ili^iil: 2, . . . , n}'^*-^ 
be the set of {1,2,..., n}-valued functions on [J°ZiXi^A- Each element / of O defines a partition 
of Xa the following way. Let 

Sy = {pGXi^A \f{p)=j} ,l<j<n,i>l. 

[{sy}r^,\ = sj. 

Then = Sj U U ... U is our partition induced by /. 
Note that on Q one has the usual Bernoulli probabiUty measure P, 

where 

Tpi,p2,...,prih,i2,---,ir) = {f I f{Ps) =is 1 < s < r}. 

A cylindric intersection set T in Xi^A is a set T = r\c,cgATc, where Tc C Xi^c- First of all note 
that the number of different cyhndric intersection sets in Xj^^ is not greater than 

-Q 2l^.,cl < 2(l^.l''*'"')2\ 

CCCA 
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Let < e < be a real number and T be a cylindric intersection set of elements of size at 
least e|Xi_^| . By the Chernoff-inequality the probability that an / G fi takes the value 1 more 
than (i + e)|r|-times or less than (i — e)|T|-times on the set T is less than 2exp{—c^\T\), where 
the positive constant depends only on e. Therefore the probability that there exists a cylindric 
intersection set T C Xi,A of size at least e|Xi.yi| for which / G il takes the value 1 more than 
(i + e)|r|-times or less than (i — e)|r|-times on the set T is less than 

2(|x.l'-^'-)2'=2exp(-c.e|X,ri). 

Since |^i | < \X2 1 < . . . by the Borel-Cantelli lemma we have the following lemma. 

Lemma 6.1 For almost all f €z the following holds: If e > 0, then there exist only finitely many 
i such that there exists at least one cylindric intersection set T C Xi ^for which f G takes the 
value 1 more than (i + e)\T\-times or less than — e)\T\-times on the set T. 

Now let us consider a cylindric intersection set Z C X"^, Z ~ ^^cccaZc, Zc C X*-^. By the 
previous lemma, for almost all f E ft, 

^i{s} nz) = -^liz) . 

■' n 

Therefore for almost all f € ft: 

n{S}r^z') = -{^,{Z')), 

■' n 

where Z' is a finite disjoint union of cylindric intersection sets in X '^. Consequently, for almost all 

^,is}nY) = -i^,{Y)), 

' n 

where Y G cr(A)*. This shows immediately that E{Sj \ (j{A)*) = i for almost all f e fl. 
Similarly, E{S'j: \ a{A)* ) = i for almost all f £ il, thus our proposition follows. I 

Remark: Later on we need a simple modification of our proposition. Let {f/i}"^! be non-negative 
real numbers, such that J27=i 1i ~ ^- Repeat the construction of the measure on D, as in Proposition 
16. II with the exception that for any p G Xi^A the probability that f{p) — i is qi instead of i. Then 
with probability one £^(5'} | (t{A)*) = qi. 

6.2 Independent Complement in Separable cr-algebras 

Let ^ be a separable cr-algebra on a set X, and let ^ he a probability measure on A. Two sub 
cr-algebras B and C are called independent if DC) = ^{B)ii{C) for every B £ B and C G C. 
We say that C is an independent complement of ;B in ^ if it is independent from B and {B., C) is 
dense in A. 
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Definition 6.1 Let A > B be two a-algebras on a set X and let fi be a probability measure on A 
A B-random k-partition in A is a partition Ai^ A2, . . . , Ak of X into A-measurable sets such that 
E{Ai\B) = 1/kfor every i = l,2,...,k. 

Tlieorem 15 (Independent Complement) Let A > B be two separable a-algebras on a set X 
and let fi be a probability measure on A Assume that for every natural number k there exists a 
B-random k-partition {Ai^k, ^2,fc, • ■ • , ^fc,fc} in A. Then there is an independent complement C of 
B in A (Note that this is basically the Maharam-lemma, see [11 ]) 

Proof. Let 5i , 6*2 , . . . be a countable generating system of A and let Vk denote the finite Boolean 
algebra generated hy Si, S2, ■ ■ ■ , Sk and {Aij\i < j < k}. Let VJl denote the atoms of Vk- It is 
clear that for every atom R ^ V^we have that E{R\B) < l/k because R is contained in one of the 
sets Ai^k, ^2,fe, • ■ • , Ak^k- During the proof we fix one ,B-measurable version of E{R\B) for every 
R. The algebra Pk is a subalgebra of Pk+i for every k. Thus we can define total orderings on the 
sets VI such a way that if i?2 e V^ with i?i < R2 and R3, R4 G Vl_^_^ with -R3 C i?4 C i?2 
then R3 < i?4. We can assume that J2ReV' ^(^' ^){^) = 1 for ^"Y element x G X. It follows 
that for k £ N, X E X and A G [0, 1) there is a unique element R{x, A, k) G V^ satisfying 

^ E{R\B)ix) < A 

R<R(x,\,k) 

and 

^ E{R\B){x) > A. 

R<R{x,\.k} 

For an element i? G "P^ let T{R, A, k) denote the set of those points x £ X for which R{x, A, k) = 
R. It is easy to see that T{R, A, k) is ;B-measurable. Let us define the A-measurable set S{X, k) by 

S{\k)= U (T(i?,A,fc)n(Ufl,,<fli?2)) 

RdVl 

and S"(A,fc) by 

5'(A,fc)= y (T(i?,A,fc)n(Ui?,<fli?2)). 

Note that 

S{\k)^{xEX\ E{R\B){x) <X}, 

R:2<Rk(x) 

where Rk (x) is the element of VI. that contains x. 

Proposition 6.2 (i) A - i < E{S{\, k) \ B)(x) < Xfor any x e X. 

(ii) Ifk < t, then S{X, k) C S'(A, t) C S"(A, k) . 

(iii) E{S'{\,k)\S{X,k) \ B){x) < ^foranyxe X. 
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Proof. First observe that 

\-\< E E{R\B){x)<\, 

R<R{x,\,k) 

for any x ^ X. Also, we have 

S{x,k)= y (i?nr(i?i,A,/c)), s"(A,fc)= y {RnT{Ri,X,k)). (6) 

R,Ri£V^,R.<Ri R,RieVlM<Ri 

That is by the basic property of the conditional expectation: 

E{S{\ k)\B)^ I ^)XT(RuX,k) ■ 

R,RieV^,R<Ri 

That is 

E{S{X,k)\B){x)^ E{R\B){x). (7) 

R<R{x,\,k) 

and similarly 

E{S'{X,k)\B){x)^ J2 E{R\B){x). (8) 

R<R(x,X,k) 

Hence (i) and (iii) follows immediately, using the fact that E{R' \ B) < ^ for any i?' e "P^. 
Observe that for any R e Vl, T{R, A, k) = UR'cR,R'ev;T{R' , A, t) . Hence 

y (i?nr(i?i,A,fc)) c y {R' r\T{R[,x,t)) c 

R,Ri&ri,R<Ri R',R[eV*M'<R[ 

c y (i?nr(i?i,A,A:)) 

ij,fliGP;,ii<iii 

Thus (|6]l implies (ii) . I 

Lemma 6.2 Lef S'(A) = Uf^^S{X, k) . Then ifX2 < Ai, then S{X2) C 5(Ai). 

Proof. Note that x G S'(A2, fc) if and only if a; G i?2 for some i?2 < A2, fc) . Obviously, 
R{x, X2,k) < R(x, Ai, fc), thus x G S'(Ai, /c). Hence S'(A2) C 5(Ai) I 

Lemma 6.3 E{S{X) \B) = X. 

Proof Since xs(XM) ^'^-^'"^ Xs(a), we have i?(5(A, fc) | B) iJ(5(A) | B). That is by (i) of 

Proposition|62]-E(S'(A) \B) = X. I 

The last two lemmas together imply that the sets S{X) generate a cr-algebra C which is independent 
from B. 

Now we have to show that B and C generate A. Let S EVk for some fc G N. We say that 5 is an 
interval if there exists an element i? G such that S ~ Ur-^kuRi- It is enough to show that any 
interval S EVk can be generated by B and C. 
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Suppose that {Tt}^^ be sets in {B,C) such that C 5 and \\E{S \ B) - E{Tt \ B)\\ tends 
uniformly to as i — > oo. Then fi{S\Tt) as t ^ oo, that is B and C generate S. Indeed, 

^iiS\Tt) - / (xs - XtJ = / (EiS I B) - E{T, \ B)) . 

So let t > fc be an arbitrary natural number It is clear that S is an interval in Vt- For a natural 
number < d < t — I let Fd denote the B-measurable set on which E{S\B) is in the interval 
(f, ^^]- Now we approximate S by 

T,= [j{FdnS{^))&{B,C). 

d=0 



Lemma 6.4 Tt C S. 

Proof. It is enough to prove that n S'(|, /c) C S for any 0<d<t-l,t<k. Observe that 

Fd^{xeX\^< J2 mi I B){x) < 

Ri<R 



and 



S{-^,k)^{xeX \ E{R2\B){x)<^}. 



Thusif a; e n5(f ,A:) thenx e S*. I 
Lemma 6.5 For any x E X, 



\E{S\B){x)~EiTt\B){x)\<- 



Proof. First note that by Proposition 16 .2 1 (iii) 



E{Si^)\B)ix)-E{S{P)\B)ix) 



< 



(9) 



Note that 



t-i 



E{T,\B){x)=Y,XfA^)E{S{-)\B){2 



d=0 



Suppose that x e Fd. Then by ^ and 



E{Tt I B){x) - m' I ^)(^ 

R'<R{x,j,t) 



< - . 
- t 



On the Other hand F(S' | B){x) = Y.R'<rE{R' \ B){x) and f < Y.R'<rE{R' \ B){x) < ^ . 
That is 



\E{S I B)ix)-E{Tt I B)ix)\ < 
The Theorem now follows from Lemma l63] immediatelv. I 



t 



I 
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Definition 6.2 Let (X, A, /i) be a probability space, and assume that a finite group G is acting on 
X such that A is G-invariant as a set system. We say that the action of G is free if there is a subset 
S of X with fJ.{S) = 1/|G| such that S^^ fl S^^ — whenever gi and g2 are distinct elements of G. 

We will need the following consequence of TheoremfTSI 

Lemma 6.6 Let A > B be two separable a-algebras on the set X and let fi be a probability 
measure on A Assume that a finite group G is acting on X such that A, B and fi are G invariant. 
Assume furthermore that the action of G on {X, B, fi) is free and for any k > 1 there exists a B- 
random k partition of X in A . Then there is an independent complement C in A for B such that C 
is elementwise G-invariant. 

Proof. Let G be a set showing that G acts freely on B. Let A\s and B\s denote the restriction 
of A and B to the set S. It is clear that if {Ai, A2., . . . , Ak} is a S-random fc-partition in A then 
{S n Ai, S n A2, . . . ,S n Ak} is a B|s-i"andom k partition in A\s- Hence by Theorem [Tsl there 
exists an independent complement Ci of B\s in A\s- The set 

g&G 

is a (T-algebra because the action of G is free. Note that the elements of C are G-invariant. Since 
E{UgeGH^\B) = J2geG ^Is)^ we obtain that the elements of C are independent from B. It 
is clear that (C, B) is dense in A. I 

6.3 Separable Realization 

In this subsection we show how to pass from nonseparable cr-algebras to separable ones. 

First note that the symmetric group Sk acts on the space X'^' by permuting the coordinates: 

{xi,X2,...,Xky = 1^-1(2),..., a;^-i(fc)). 

The group also acts on the subsets of [k] and cr(A)'^ = a{A'^), where A" denotes the image of the 
subset A under tt G 5fe. We will denote by Sa the symmetric group acting on the subset A. 

Definition 6.3 A separable system on X'^ ,r < k is a system of atomless separable a-algebras 
{1{A) I A G and functions {Fa ■ X*"' — > [0, 1] | A G with the following properties 

L 1{A) is a subset of a{A) and is independent from a{A)* for every 7^ A C [k]. 

2. 1{AY = 1{A'^) for every permutation tt G Sk. 

3. — S for every S G I (A) andn G Sa- 

4. Fa is an l{A)-measurable function which defines a measurable equivalence between the mea- 
sure algebras of (X.'^, 1{A), 11^) and [0, 1]. (see Appendix) 
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5. Fa{^) — Fa^ {^^) for every element X G X'' , n E Sk and A C [k]. 

The main proposition in this section is the following one. 

Proposition 6.3 For every separable a-algebra A in <j{[k\) there exists a separable system such 
that for every set M E A there is a set Q e {1{A) \ A £ r{[k])) with fi^^] {M AQ) = 0. 

This proposition immediately implies Theorem |6] since the map F : X'^ ^^ [0, whose 
coordinate functions are {Fa \ A € r{[k])} constructed in Proposition l6.3l is a separable realization. 
We will need the following three lemmas. 

Lemma 6.7 Let B ^ A two a-algebras on a set Y, and let ^ be a probability measure on A. Then 
for any separable sub-a-algebra A of A there exists a separable sub a-algebra B of B such that 
E{A\B) = E{A\B) for every AeA 

Proof. We use the fact that ^ is a separable metric space with the distance d{A, B) — ii{AAB). Let 
W = {Di, D2, ■ ■ ■}hea countable dense subset of ^ with the previous distance. Let Cp ^ ^ E{Di \ 
B)^^{p,q), where p < q are rational numbers. Clearly, E{Di \ B) is a ;Bi-measurable function, 
where = {C'^ g \ p < q e Q). Obviously, E{D, \ B) = E{D., \ B) for any i > 1, where 
B ^ {Bi \ i ^ 1,2,...) . Now observe that E{D, \ B) ^ E{D, B) and E{Di \ B) ^ E{D,B) if 
D, D. Hence for any D eA, E{D \ B) = E{D \ B). I 

Lemma 6.8 Let A C [k] be a subset and assume that there are atomless separable a-algebras 
d{{i}) C a{{i}),i £ A such that d({i})'^ = d{{i'^}) for every i G A and n £ Sa- Then S a acts 
freely on {d{{i})\i G A). 

Proof. The permutation invariance implies that there exists a cr-algebra ^ on X such that P^^ (A) = 
d{{i}) for every i £ A. Let F : X ^ [0, 1] be a ^-measurable measure preserving map. Now we 
can define the map G : X^ -> [0, 1]^ by 

G[xi^ , , . . . , ) := {F[xi^ ) , F^Xi^ ), • ■ . , -^(2^11^1 ))■ 

Let us introduce 5" := {{yi,y2, . . . ,yr)\yi < y2 < ■ ■ ■ < Vr} C [0,1]^ and G-^{S'). 
Clearly /i^ (S") = 1 / 1 A [ ! and 5"" n S''' = for every two different elements n ^ pinSA- I 

Lemma 6.9 Let k be a natural number and assume that for every A C [k] there is a separable 
a-algebra c{A) in a{A). Then for every A C [k] there is a separable a-algebra d{A) in a(A) with 
c{A) C d{A) such that 

L E{R\{d{B)\B G A*)) ^ E{R\a{A)*) whenever R £ d{A). 

2. d{AY = d{A^) for every element TT £ S}.. 

3. d{B) C d{A) whenever B C A 
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Proof. First we construct algebras d'{A) recursively. Let (i'([fc]) be (c([fc])'^|7r G Sk)- Assume 
that we have already constructed the algebras d'{A) for |^| > t. Let A C [k] be such that |^| = 
t. By Lemma Wn\ we can see that there exists a separable subalgebra d'{A) of cr{A)* such that 
E{R\a{AY) = E{R\(f(A)) for every R e d'{A). Since a{A)* is generated by the algebras 
{a{B)\B G A*} we have that every element of <j{A)* is a countable expression of some sets in 
these algebras. This implies that any separable sub cr-algebra of (j{A)* is generated by separable 
sub (T-algebras of the algebras a{B) where B ^ A* . In particular we can choose separable a- 
algebras d'{A, B) D c{B) in a{B) for every B e A* such that {d'{A, B)\B e A*) D d!{A). For 
a set B C [k] with \B\ = f — 1 we define d'{B) as the a-algebra generated by all the algebras in 
the form of d'(C, DY , where -k Sk , D"" = B , \C\ = \D\ + 1 and £> C C. Since d'(C, DY C 
(t{DY = cr(B) we have that d'{B) C (j{B). Furthermore we have that d' {BY =d' {B"" ) for every 

Now let d{A) := {d'{B) \ B C A), the second requirement in the lemma is trivial by definition. 
We prove the first one. The elements of d{A) can be approximated by finite unions of intersections 
of the form PIbca where Tb G d'{B) and so it is enough to prove the statement if R is such an 
intersection. Let Q = HbcA b^a "^b- Now 

E{R\{d{B)\B e A*)) = E{R\{d'{B)\B c A,B A)) . 
By the basic property of the conditional expectation (see Appendix) : 

EiR\{d'iB)\B CA,B^A))= E{TA\{d' (B)\B cA,B^ A))xq = £;(T^|a(A)*)xQ = 

= E{R\a{A)*). 

I 

Proof of Proposition 16.31 We construct the algebras 1{A) in the following steps. For each non- 
empty subset A C [k] we choose an atomless separable cr-algebra c{A) C (t{A) containing a 
(T (A)* -random r-partition for every r. We also assume that A C c([/c]). Applying Lemma \6M 
for the previous system of separable cr-algebras c{A) we obtain the cr-algebras d{A). By Lemma 
16.81 and the permutation invariance property of the previous lemma, S[,.] acts freely on = 
{d{B)\B G [r]*). Hence using Lemma |631 for every 7^ A G [A:] we can choose an independent 
complement l{[r]) for ^([7-])* in d{[r]) such that l{[r]) is elementwise invariant under the action of 
S[r]- The algebras ;([r]) are independent from cr([r])* since ^{R) = E{R\d{[r]Y) = E{R\a{[r])*) 
for every R G l{[r]). Now we define 1{A), where \A\ r by 1{A) — l{[r]Y for some tt G Sk, 
7r([r]) = A. Note that I {A) does not depend on the choice of tt. By Lemma ITTI of the Appendix 
we have maps : X'' [0, 1] such that F^^ defines a measure algebra isomorphism between 
M{[0, l],B, A) and A^(X'' J[r],/i''). Let Fa = n^^oF^r], where tt maps [r] to A. Again, Fj^j does 
not depend on the particular choice of the permutation tt. I 



45 



7 Appendix on basic measure theory 



In this section we collect some of the basic results of measure theory we frequently use in our paper. 

Separable measure spaces: Let {X, A, /i) be a probability measure space. Then we call A, A' € 
A equivalent if /i(AAA') = 0. The equivalence classes form a complete metric space, where 
[B]) ~ fj,{AAB) . This classes form a Boolean-algebra as well, called the measure algebra 
M{X, A, fi). We say that {X,A,pi) is a separable measure space if Ai{X, A, fi) is a separable 
metric space. It is important to note that if {X, A, ii) is separable and atomless, then its measure 
algebra is isomorphic to the measure algebra of the standard Lebesgue space ( [0, 1] , B, A), where B 
is the cr-algebra of Borel sets (see e.g. fS^). We use the following folklore version of this theorem. 

Lemma 7.1 If (X, A, ^) is a separable and atomless measure space, then there exists a map f : 
X [0,1] such that f^^{B) C A, fi{f^^{U)) = \{U) for any U e B and for any L e A there 
exists M € B such that L is equivalent to f^^{M). 

In other words, if F : [0, 1] ^ X is an injective measure preserving measure algebra homomor- 
phism such that the image of the the Borel-algebra is just A, then F can be represented by the map 
f. That is for any measurable set U C [0, 1], F{U) is the set representing f~^(U). 

Proof Let Iq denote the interval [0, 5], h = Then let /o,o = [0,j], hs = [^,5], 

h.o = [57 1]' ~ [jA]- Recursively, we define the dyadic intervals Iai,a2,...,ak^ where 
(ai, a2, . . . , Qifc) is a — 1-string. Let T be the Boolean-algebra isomorphism between the mea- 
sure algebra of {X,A,iJi) and the measure algebra of ([0, 1],;B, A). Then we have disjoint sets 
[/o, Ui & A such that T{[Un]) = [h], T{[Ui]) ^ [h]. Clearly ^i{X\{Un U Ui) = 0. Similarly, 
we have disjoint subsets of Uo, Uq.q and C/0,1 such that T([C/o,o]) = [^0,0] and T{[Uo.i]) = [Io,i]. 
Recursively, we define Uai,a2:...,ak £ such that Uai,a2,...,ak-i,o and ?7ai,a2,....Qfc-i,o are disjoint 
and T{[Uai. a2,....ak]) = ^Qi.Q2,--,ctfc- For any fc > 0, the set of points in X which are not included 
in some Uai,a2,...,ak has measure zero. Now define 

f{p) :— ri^ilai,a2,---,ak: ) 

where for each fc > l,p 6 Uai,a2,...,ak- It is easy to see that / satisfies the conditions of our lemma. 
I 

Generated cr-algebras: Let {X, C, fi) be a probability measure space and ^1, ^2, • ■ • , Ak be sub- 
fj-algebras. Then we denote by {Ai | 1 < « < fc) the generated cr-algebra that is the smallest 
sub-cr-algebra of C containing the AiS. Then the equivalence classes 

[u7^i(^in^^2n...nAi)], 

where Al e ^4.^ and (Af n A| n . . . n A|) n n A| n . . . n ^1) = if s ^ t form a dense subset 
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in the measure algebra A4 {X, {Ai \ I < i < k), fi) with respect to the metric defined above (see 

S). 

Independent subalgebras and product measures: The sub-cr-algebras Ai, A2, ■ ■ ■ , Ak C C are inde- 
pendent subalgebras if 

n{Ai)^i{A2) . . . n{Ak) = ^x{Al n n . . . n Afc) , 

if Ai e Ai. 

Lemma 7.2 Let Ai, A2, ■ ■ ■ , Ak G C be independent subalgebras as above and fi : X ^ [0, 1] 
be maps such that f^^ defines isomorphisms between the measure algebras A4(X,Ai, ji) and 
A4{[Q, 1], B, A). Then the map F^^, F = ©f^i/i : X — > [0, l]'^ defines an isomorphism between 
the measure algebras M{X, {Ai \ 1 <i < k),^) and M{[0, 1]*^, ,\^). 

Proof. Observed that 

^l{F-\yJU[^, X ... X A^)) = ^ A'=[Al X . . . X Ay 

i=l 

whenever {A\ x . . . x A\,Yi=i ^re disjoint product sets. Hence defines an isometry between 
dense subsets of the two measure algebras. I 

Radon-Nykodym Theorem: Let (X, A, p) be a probability measure space and v be an absolutely 
continuous measure with respect to /i. That is if ii{A) — then v{A) = as well. Then there exists 
an integrable yl-measurable function / such that 

p{A)^ / fdii 

J A 

for any A ^ A. 

Conditional expectation: Let (X, A, p) be a probability measure space and B C be a sub-tr- 
algebra. Then by the Radon-Nykodym-theorem for any integrable ^-measurable function / there 
exists an integrable B-measurable function E{f \ B) such that 

/ E{f I B)d^^ = / fdp, 

J B J B 

if B E B. The function E{f \ B) is called the conditional expectation of / with respect to B. It is 
unique up to a zero-measure perturbation. Note that if a < f{x) < b for almost all x E X, then 
1 E{f I B){x) < b for almost all x G X as well. Also, if g is a bounded Z?-measurable function, 
then 

E{fg I B) = E{f I B)g almost everywhere . 
The map / E{f, B) extends to a Hilbert-space projection E : L'^{X, A, fi) L^{X, B, p). 
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Lebesgue density theorem: Let A E M" be a measurable set. Then almost all points x E A is sl 
density point. The point x is a density point if 

Vol(BJx)nA) 
lim -— - — = 1 , 

r^O Vol{Brix)) 

where Vol denotes the n-dimensional Lebesgue-measure. 

Coupling: Let A, B are sets. Let X be an A-valued random variable and y be a S-valued random 
variable. A coupling of X and F is a A x B-valued random variable Z, such that the first component 
of Z has the distribution of X and the second component of Z has the distribution of Y . 
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